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Africa’s elite 


Anew forum promises to bring deserved prestige to outstanding African researchers, 
and demonstrates the continent’s untapped potential. 


tuberculosis? What is space-time? These are among the ques- 
tions being explored by African scientists who last week joined 
together to open the world’s first truly pan-African scientific gathering. 

The Next Einstein Forum (NEF), held in Dakar, Senegal, deserves to 
becomea regular feature of the global science landscape. Its purpose: 
to publicly celebrate and support some of the most outstanding young 
researchers active in, or closely tied to, the continent. 

There were 15 NEF Fellows in Dakar last week — from Egypt, 
Morocco, South Africa, Senegal, Ethiopia and places in between (see 
http://nef.org/nef-fellows). All have proved themselves as scientists, 
and all have deep connections with Africa. Most work on the continent; 
others are based in prestigious institutions in the United States and in 
Europe, forming part of the sort of diaspora on which other regions — 
especially China — have built a thriving research ecosystem. 

The launch included a scintillating show by African singers and 
dancers, and political messages of science-centred ambition by President 
Macky Sall of Senegal and Rwandan President Paul Kagame. But from 
the outset, as seasoned conference attendees agreed, it was the young sci- 
entists who were the freshest and most compelling feature of the event. 

They presented work on a three-wheeled tractor that can negotiate the 
muddy tracks that make up many of Africa's rural roads, and can thresh 
maize (corn) and pump water for irrigation; and a theoretical model for 
the dark energy that drives the Universe’ accelerating expansion. They 
are researching new preventive treatments for cardiovascular diseases 
that are highly prevalent in black populations, and the fundamentals of 
semantic data-analysis. They are working towards databases that use 
artificial intelligence to generate their own hypotheses. And more. 

The meeting was not just a showcase of what is excellent. The 
all-too-familiar challenges faced by Africa’s researchers were thor- 
oughly rehearsed, both by the Fellows and by African, European and 
US policy grandees who sat alongside them on panels. 

Neighbouring countries face drastically different challenges and have 
wildly unequal resources. There is inadequate public and research infra- 
structure anda lack of intellectual-property protection. A culture that 
holds that ‘science and maths are only for the best’ too often hampers 
teachers who wish to encourage science, and widespread assumptions 
that these subjects are only for boys dissuade girls from pursuing science, 
and present discriminatory obstacles to female researchers. 

Despite the challenges, it was striking how the meeting was an 
expression of determination to find the gems among Africa’s researchers 
and support them. Particularly welcome was the sense that fundamental 
research is as essential as work that has direct societal applications. 

A capacity for fundamental science and mathematics is essential 
if ideas and techniques developed elsewhere are to be adapted and 
absorbed in African contexts. As Senegal’s research minister rightly 
emphasized, World Health Organization protocols for hepatitis B vac- 
cination in Africa, originally derived in Asian settings, were changed 
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because of feedback from African biologists. And innovations in 
handling partial differential equations need to be applied not only to 
fundamental physics, but also to water management. 

How to build on what was undoubtedly a successful exhibition of 
talent? The prime backers of the meeting are the African Institute for 
Mathematical Sciences — now spreading its wings in several countries 
from its roots in South Africa (see go.nature.com/9putdt) — and the 

Robert Bosch Foundation, based in Stuttgart, 


“The meeting was Germany. Both deserve credit, and would 
anexpression of do well to ensure that the Fellows grow in 
determination number and that alumni keep coming to the 
to find the gems forum as active members of a quasi-family. If 
among Africa’s that happens, the meeting could develop into 


a prestige event for those inside and outside 
Africa who want to understand and support 
the best of indigenous African research. Credit to Rwanda for hosting 
the next NEF in 2018. 

There is also an opportunity to make the most of the growing 
number of schemes that support younger scientists in Africa, along- 
side the NEF — the National Young Academies and Global Young 
Academy, the Africa Science Leadership Programme at the University 
of Pretoria (see go.nature.com/fkaq9f) and the DELTAS Africa pro- 
gramme of the Wellcome Trust (see go.nature.com/b23xux). 

The drive from the congress centre to the participants’ hotels 
highlighted the disparities between the rich worlds of many attend- 
ees home countries and the streets of Dakar. Poverty and inequality 
can be reduced only step by step; the step represented by this forum 
was significant. It showed what powerful commitment there is to be 
tapped in this emerging generation of young African scientists. The 
venture and the researchers it represents deserve strong support. m 


researchers.” 


Metropolis now 


Growing urbanization is heralding 
anew era of science in the city. 


that had lain idle for almost four decades. The government 
reconvened its Central Urban Work Conference and gave it a 
crucial task — to report back on how to revamp and revitalize the 
nation’s growing, and choking, cities. 
When the expert group issued its recommendations last month, it 
backtracked on many of the country’s previous urban policies that had 
prized growth above all else. The new plan promises denser streets, to 
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break the damaging reign of the car, mixed-use neighbourhoods with 
greater diversity, and more investment in public transport. China has 
long been said to have ‘a thousand cities with the same face’. Now it is 
trying to puta smile on them. 

Amid the scientific and social priorities for the coming years, the 
study and design of cities must be right at the top. Humans are now an 
urban species and have been since city dwellers started to outnumber 
rural folk for the first time almost a decade ago. The trend is set to 
continue, and the United Nations has estimated that 70% of the global 
population will live in an urban environment by 2050. 

What kind of environment will that be? The signs so far are not 
good. Too many people, especially those in rapidly developing (and 
urbanizing) countries experience city life much as Charles Dickens 
put it in his 1850s work Little Dorrit: “Miles of close wells and pits 
of houses, where the inhabitants gasped for air, stretched far away 
towards every point of the compass. Through the heart of the town 
a deadly sewer ebbed and flowed, in the place ofa fine fresh river” 

Scientists are responding. Universities in many of the world’s 
cities — London, New York, Boston, Madrid, Glasgow, Zurich and 
Singapore among them — are leading a new wave of evidence-driven, 
data-rich research that aims to understand what makes cities tick, 
and to keep them running smoothly. Some of these issues, from how 
best city dwellers should move around, to how to protect their water 
and them from it, are discussed in a collection of articles this week in 
a Nature Outlook supplement on urban health and well-being (turn 
to page 306). 

Science and technology has a chequered history in the city. On the 
plus side, great urban visionaries of the past — such as the British 
town planner Patrick Geddes — trained as scientists, and were able to 
bring the ideas of ecology and the natural environment to their social 


tasks. In a less enlightened contribution, some of the haste to design 
cities around the automobile was justified by claims to rational science. 
(Even today, many cities in the developing world spend 70% of their 
transport budgets on serving the car, even though 70% of trips are 

made by foot or public transport.) 
By the 1960s, cities were almost a frontier too far even for science. 
Asked by then-US President Lyndon Johnson to solve the social prob- 
lems rooted in US urban areas, a specially 


“Urban science convened group of scientists in Woods Hole 
has someway to _ in Massachusetts responded that “creating a 
go torestore its safe, happy city is a greater challenge thana 


trip to the moon” Residents of the Bronx in 
New York City would have agreed: a botched 
attempt to model demand for fire services in the 1970s contributed to 
a series of ill-judged fire-station closures and an outbreak of (oddly, 
not predicted) fires. 

Urban science has some way to go to restore its reputation, but the 
era of big data offers an opportunity, and a new way of thinking. Even 
as civic leaders crow about the unique merits of their towns, research 
on cities is trying to dismantle them. Rather than looking at what 
makes cities different — and then planning accordingly — modern 
urban science seeks what they have in common. 

The new models of urban life start from the ground up and track, for 
example, people’s journeys and the reasons for them, rather than the 
flow of traffic through a specific, frequently gridlocked roundabout or 
intersection. They try to make city science quantifiable, testable and 
reproducible. It’s a big ask — but that trip to the Moon was achieved. 
And as the same Woods Hole scientists also told Johnson, the prob- 
lems of the city “nevertheless, can be attacked in the same logical way 
we have gone about exploring the universe”. m 


reputation.” 


Practical DNA 


The promise of DNA origami shows signs of 
coming to fruition a decade after its debut. 


develop tools of investigation and to make useful and some- 

times revolutionary things with which to build our future. And 
every now and again, a piece of science comes along that seems like 
a work of art. 

All of this was exemplified by a research paper published in Nature 
ten years ago that, literally, produced smiles (see Nature 440, 297- 
302; 2006). Using an astoundingly simple and general method to 
assemble strands of DNA into arbitrary shapes, the research gener- 
ated ‘smileys’ that graced the cover of Nature and announced the 
arrival of DNA origami to the world. 

The robustness of this method changed the game for DNA nano- 
technology, which has since developed at an astonishing pace. It isa 
beautiful demonstration of how science can progress. 

The concept behind DNA origami was laid down in the early 
1980s by crystallographer Nadrian Seeman, who realized that the 
ability of DNA molecules to carry and transfer information accord- 
ing to strict base-pairing rules could be used to rationally assemble 
structures with precisely controlled nanoscale features. 

This unprecedented level of programmability makes DNA a unique 
building material. Nanodesigners have embraced the biomolecule to 
fabricate intricate tiled patterns, boxes with lids that can be opened 
and arrays of precisely located binding elements that can incorporate 
proteins, dyes and other functional materials into regular lattices. 

Pivotal to the success of DNA as a nanoscale building material 
have been automated methods to synthesize short DNA molecules 
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of any sequence. A detailed understanding of how base-pairing 
translates into the formation of DNA double helices has also been 
crucial. Such helices control the shapes into which DNA molecules 
with given sequences will fold. 

DNA origami provides the missing ingredient: a versatile yet 
straightforward assembly method. Computer-aided design pro- 
grams determine how DNA scaffolds can be folded to realize desired 
structures, as well as which short DNA strands, or staples, are needed 
to hold the structures in shape. 

Individual structures can also be assembled into more complex 
patterns, and sites that bind to functional materials can be intro- 
duced at any position. 

The many eye-catching structures that have been built have 
pleased those of us with an appreciation of beauty. But even the most 
creative science will ultimately face the question: what is the point? 

DNA nanotechnology has long searched for relevance. It is 
unrivalled in its ability to build complex structures with near-atomic 
precision, but the results tend to be labile, soft and so small that it is 
a challenge to put them to practical use. 

Yet applications that address basic problems in science have 
emerged. DNA structures can serve as tools for determining the 
structures of proteins or as templates for assembling electronic com- 
ponents and basic devices. Responsive DNA structures can target 
diseased cells, and artificial membrane channels formed from DNA 
can act as single-molecule sensors. 

Real-world applications might become feasible through recent 
developments — for example, improvements to the folding process 
that reduce assembly time and boost yield. Initial steps have also 
been taken to efficiently pair DNA nanostructures with technologi- 
cally relevant substrates. 

Many challenges remain, and DNA nano- 
technology is far from maturity. But a growing 
number of scientists are entering the field to 
make more than just art. Watch this space. = 
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WORLD VIEW .pecnnicor sen 


Advancement of Science (AAAS) in Washington DC last month 

was one of the best I’ve witnessed in more than 20 years of regular 
attendance. The policy sessions were packed and genuinely stimulating. 
I met tons of smart, influential people I hadn't seen for ages, and we all 
enjoyed a good chinwag about how better to engage with the public — 
the meeting’s theme for 2016. 

The only trouble was what was going on outside the hotel — in the 
United States and the world at large. 

In fact, the AAAS meeting took place in a sort of semi-conscious 
never-never land. The science-policy crowd talked a great game even 
as the pillars of the republic crashed noisily down around their heads. 

Supporters or representatives of Donald Trump, the likely Republican 
nominee for this November's US presidential elec- 
tion, his extremely conservative rival Ted Cruz, 
or even Bernie Sanders, the Democrat insurgent, 
were simply not involved in these discussions. 
They never are. Senior scientists are instead inex- 
tricably linked to the centrist, free-market political 
establishment that has tended to rule, but which is 
now falling dangerously from public favour. 

It is not just in the United States that this con- 
sensus — and perhaps democracy itself — is in 
danger. Poland has just elected a reactionary 
government that is clamping down on press 
freedom; France is toying with electing far-right 
politician Marine Le Pen to the presidency; and 
the rest of the world’s elected leaders are each 
threatened, to a greater or lesser extent, by eco- 
nomic and migration crises. Populist nationalism 
is on the march again — exemplified by the rise 
of Trump, whose mode of operation does not countenance the opin- 
ions, advice or goodwill of anybody else. 

Not for nothing are people being urged to read Rubicon: The Triumph 
and Tragedy of the Roman Republic, Tom Holland’s summary of how 
Rome fell. The establishment — with which science has habitually 
enjoyed a genial, if subservient relationship — is on the rocks. 

Many laboratory researchers perceive this, I fear, to be someone 
else’s problem. But it isn’t. If the West is really in its decline-and-fall 
stage, its Caligula stage, its Donald Trump stage, then this isn’t just 
an issue for political and financial elites. It’s also a problem for the 
‘experts’ who crawl around after these elites, massaging their egos and 
defending their interests. 

The playwright Bertolt Brecht had a good line on expertise. In his 
plays, doctors, lawyers and other ‘experts’ are 


Ts annual meeting of the American Association for the 


generally portrayed in threes. They squabble DNATURE.COM 
haplessly among themselves, each manoeuvring _ Discuss this article 
into the position that most elevates themselves __ online at: 


in the eyes of their aristocratic paymaster. go.nature.com/5xjnt5 
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~ The elephant in the room 
we can’t ignore 


If Donald Trump were to trigger a crisis in Western democracy, scientists 
would need to look at their part inits downfall, says Colin Macilwain. 


And that, sadly, is the role to which senior scientific leaders have 
sometimes reduced themselves. In the main, they have been happy to 
accept the autocracy of politics and finance, even, like the president of 
the European Research Council, hanging around at the annual meet- 
ing of business leaders at Davos in Switzerland, hoping to pick up 
crumbs from the rich man’s table. 

The problem extends down into the community itself. We like to 
talk about ‘engaging the public, but many scientists really just want 
to talk at them. And too many ordinary scientists hold politicians 
in utter intellectual contempt — even though it is the scientists who 
have chosen a career that allows them to pursue relatively simple 
problems (such as building a machine to detect gravitational waves) 
rather than genuinely difficult ones (such as running a social-care 
programme in a small town). 

And those senior scientists who do engage with 
the government or public — as scientific advis- 
ers, for example — often take up highly political 
positions without acknowledging that they are 
doing so. For example, they support free-trade 
agreements that cede the right of democratic 
governments to control things such as cigarette 
advertising or pesticide use without hard, scien- 
tific evidence. This is a political position that is 
pursued with great dedication by global corpora- 
tions — and that is haplessly bought into by many 
scientists without a thought for its consequences. 

I admit that it is difficult to bring more 
subtle and varied political approaches to the 
table. Groups of researchers that have tried to 
do so — such as the Federation of American 
Scientists and Union of Concerned Scien- 
tists — have struggled to gain traction. Still, there is a fresher, grass- 
roots movement, exemplified by local ‘sceptics’ groups, through 
which younger scientists are trying to make their work relate to 
society’s wider concerns. 

But at the top, there is paralysis: leading scientific organizations do 
little except chase money and reinforce the ruling nexus of politics and 
finance — even since the financial crisis of 2008, which discredited 
the free-market philosophy that underpins that nexus. I argued years 
ago (see Nature 479, 447; 2011) that scientific leaders had failed to 
respond in any meaningful way to that collapse, and I’m still waiting. 

The political structure of the West is in deep trouble, and should it 
fall apart, there will be plenty of blame to go around. Most will go to 
political and financial elites, or to rowdy mobs. But some will belong to 
people in the middle who have taken public funds, defended elites and 
then stood back and watched as democracy got ridden over a cliff. m 


Colin Macilwain writes on science and policy from Edinburgh, UK. 
e-mail: cfmworldview@googlemail.com 
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Gut microbes raise 
heart-attack risk 


Gut microbes produce a 
chemical that enhances 
clotting in the arteries, 
increasing the risk of heart 
attack and stroke. 

Stanley Hazen of the 
Cleveland Clinic in Ohio and 
his colleagues treated human 
platelets, which form blood 
clots, with a compound called 
TMAO. This is made in the 
body from a waste product 
of gut microbes, and has 
been linked to heart disease. 
The team found that TMAO 
made the platelets form 
artery-blocking clots faster. 
The researchers increased 
blood TMAO levels in mice 
by feeding them a diet that 
was rich in choline, a TMAO 
precursor, and found that the 
animals formed clots faster 
than did those with lower 
TMAO levels. 

This effect was not seen 
in animals that lacked gut 
microbes or that were treated 
with antibiotics. When 
intestinal microbes from mice 
that produced high levels of 
TMAO were transplanted into 
mice with no gut microbes, 
the recipients’ clotting risk 
increased. The results reveal 
a link between diet, gut 
microbes and heart-disease 
risk, the authors say. 

Cell http://doi.org/bdb2 (2016) 


Waste gas makes 
liquid fuel 


Waste gases containing carbon 
dioxide can be converted into 
diesel, thanks to a bacterium 
and an engineered yeast. 
Gregory Stephanopoulos 
and his colleagues at the 
Massachusetts Institute of 
Technology in Cambridge 
developed a two-stage 


Order of notes is key in bird calls 


A bird species derives different meanings from 
varying combinations of notes, just as humans 
understand complex meanings from words 


combined in different ways. 


Toshitaka Suzuki of the Graduate University 
for Advanced Studies in Hayama, Japan, and 
his colleagues played recordings of four notes 
— A,B, Cand D — in different orders for the 
Japanese great tit (Parus minor; pictured), which 
normally uses more than ten different notes in 


process that uses bioreactors 
to create liquid fuel out of 
gas mixtures containing 
CO,,. The first stage involves 
the bacterium Moorella 
thermoacetica, which 
converts mixtures of CO, and 
other gases such as carbon 
monoxide or hydrogen into 
acetic acid. An engineered 
yeast, Yarrowia lipolytica, 
then transforms the acetic 
acid into an oil that can be 
turned into diesel using 
existing industrial processes. 
This method, with further 
enhancements to boost 
efficiency, could be used to 
produce fuel from the waste 
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its calls. Playing ABC prompted the birds to scan 
horizontally for predators. On hearing a repeated 
D note, the birds approached the source of the 


calls. ABC-D calls elicited both behaviours, but 


gases that are generated by 
industrial sites such as steel 
mills and coal-fired power 
plants, the authors say. 
Proc. Natl Acad. Sci. USA 
http://doi.org/bdb5 (2016) 


Gene blocks anti- 
tumour response 


A common cancer gene works 
in part by helping tumours to 
evade immune cells. 

Dean Felsher of Stanford 
University in California and 
his colleagues studied the 
effects of MYC — a gene that is 
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playing D-ABC invoked little or no response. 
The authors suggest that the order of the 

notes determines meaning, and say that 

this is the first experimental evidence for 

‘compositional syntax’ in a wild animal. 

Nature Commun. 7, 10986 (2016) 


often overexpressed in cancer 
— ina mouse model of a type 
of leukaemia. They found that 
higher MYC expression levels 
increased the production 
of two proteins, PD-L1 and 
CD47, that help cancer cells 
to hide from the immune 
system. When MYC was 
inactivated, CD47 and PD-L1 
levels dropped and tumour 
size decreased. Tumour data 
from humans showed a strong 
link between levels of MYC 
expression and levels of these 
immune-evasion signals. 
People with cancers that 
overexpress MYC could 
benefit from treatments that 
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boost the immune attack 
against tumours, the authors 
suggest. 

Science http://doi.org/bc7p 
(2016) 


| NEUROSCIENCE 
Altered sensations 
in anxiety 


Anxiety disorders could 
involve not only cognitive, 
but also sensory changes in 
the brain. 

Recent studies have 
suggested that people with 
anxiety, after learning a 
negative stimulus, respond 
negatively to similar but 
neutral stimuli more often 
than healthy people. Rony 
Paz at the Weizmann Institute 
of Science in Rehovot, Israel, 
and his colleagues found 
that individuals with anxiety 
disorders also perceive these 
stimuli less precisely than 
healthy people do. After 
learning to associate a tone 
with either monetary gain or 
loss, participants were asked 
to decide whether a series of 
other sounds were a match to 
the previous ones or were new. 
People with anxiety disorders 
mistook a wider range of 
frequencies for the tones they 
had learned, compared with 
healthy people. Learned tones 
and neighbouring sounds 
triggered brain activity that 
showed greater similarity in 
people with the disorders than 
in healthy people. This effect 
was in the brain’s auditory 
cortex and in the amygdala, 
which processes fear. 

The findings suggest that 
people with anxiety have 
altered perception of certain 
stimuli, the authors say. 

Curr. Biol. http://doi.org/bc3z 
(2016) 


Bacterial toxins 
invite infections 


Certain pneumonia-causing 
bacteria produce compounds 
that help other pathogenic 
bacteria to spread through 
the lungs. 

Bret Sellman at 
MedImmune, a biotechnology 
firm in Gaithersburg, 
Maryland, and his colleagues 
infected mice with a variety 
of bacterial species, either 
individually or in combination 
with Staphylococcus aureus, 
which can cause respiratory 
and other infections. Mice 
that were co-infected with 
S. aureus had higher levels of 
both microbe species in their 
lungs, and were more likely to 
die than animals infected with 
a single pathogen. The team 
found that a protein produced 
by S. aureus, called a-toxin, aids 
the growth of several bacterial 
species by impairing immune- 
cell function. Early treatment 
with an antibody against 
a-toxin helped to eliminate 
S. aureus and prevented other 
pathogens from multiplying. 

The authors suggest that 
antibody-based treatments 
targeting a single bacterial 
species could help some people 
who are infected with multiple 
pathogens. 

Sci. Transl. Med. 8, 329ra31 
(2016) 


How flying 
beetles waterski 


The waterlily beetle exploits 
properties of the interface 
where air and water meet 
to glide quickly across the 
surface of ponds. 

Manu Prakash at Stanford 
University in California 
and his colleagues filmed 
waterlily beetles (Galerucella 
nymphaeae; pictured) at 
3,000 frames per second to 
characterize the mechanics of 
the insect’s unusual mode of 
flight on the two-dimensional 
surface. They found that 
the claws on the beetles’ legs 
remain submerged during 
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flight, anchoring the insects to 
the water. 

Keeping four of their six 
legs on the water, the insects 
use the fluid’s surface tension 
to support their body weight 
and move by flapping their 
hind wings, cruising along 
the water's surface at speeds of 
up to halfa metre per second. 
Moving any faster would 
render them airborne, because 
the ripples they produced on 
the water would release their 
anchors. 

J. Exp. Biol. 219, 752-766 (2016) 


Milky Way’s 
bulging waistline 


The mass of stars in the Milky 
Way’s central bulge (pictured) 
is about 20 billion times the 
mass of the Sun — a much 
higher estimate than in most 
previous studies. 

The central bulge protrudes 
from the Galaxy’s main disk 
like the yolk of a fried egg 
and hosts a large density of 
stars. To estimate the mass 
of those stars, Elena Valenti 
at the European Southern 
Observatory (ESO) in 
Garching, Germany, and her 
team used a catalogue of stars 
ofa particular type. The data 
are from ESO’s Visible and 
Infrared Survey Telescope for 
Astronomy (VISTA) at the 
Paranal Observatory in 
Chile. The team also did 
a deeper study of all the 
stars in a small region of the 
bulge, in part using the Hubble 
Space Telescope. 


Valenti says that this is the 
first study of the bulge based 
entirely on observation, without 
the help of theoretical models. 
Astron. Astrophys. 587, L6 (2016) 


LONGEVITY 


Genetic switches 
for long life 


Researchers have homed in on 
the genetic control points that 
allow nematodes to live longer 
when they are on a low-calorie 
diet. 

A team led by Jing-Dong 
Han of the Chinese Academy of 
Sciences in Shanghai analysed 
gene-expression changes 
over time in the nematode 
Caenorhabditis elegans. The 
worms were subjected either 
to caloric restriction or to 
intermittent fasting, both of 
which extend worm lifespan. 

The team identified changes 
in the expression of various 
genes at different times, with 
metabolism genes responding 
early during the diet, and 
those linked to cell division 
and DNA repair changing later 
on. The researchers then used 
an algorithm to identify three 
sets of genes that regulate this 
altered expression. Changing 
the activity of the pathways 
controlled by these genes 
extended the lifespan of the 
worms, mimicking the effect 
of dietary restriction. 

Cell Metab. 23, 529-540 (2016) 
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AlphaGo victorious 
The world’s leading Go 
player, South Korea's Lee 
Sedol, lost his final match 
in Seoul against Google 
DeepMind’s AlphaGo 
machine on 15 March. The 
tightly fought game brought 
the best-of-five competition 
to an end with four wins 

for the computer versus 

one for the human player. 
Sedol came back from three 
consecutive losses to beat 
the artificial-intelligence 
system in the fourth match, 
but ultimately missed out 
on the US$1-million prize. 
Go originated more than 
2,500 years ago in China and 
involves placing black and 
white counters on a board. 
See page 284 for more. 


Brexit warning 
Physicist Stephen 

Hawking is one of more 
than 150 scientists, 
mathematicians, economists 
and engineers at the 
University of Cambridge, 
UK, who warn of a disaster 
for the nation’s science if 
Britain exits the European 
Union (known as Brexit). 
A referendum to be held 
on 23 June will ask whether 
the country should leave 
the EU. Ina 10 March letter 
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Consecutive months in 
which the global monthly 
temperature record has 
been broken. February’s 
temperature was 1.35°C 
above average for the month. 
A strong El Nifio weather 
system has contributed to 
the record-breaking run. 
Source: NOAA 


Famous killer whale nears end of life 


Tilikum, a killer whale (Orcinus orca) at 
SeaWorld in Orlando, Florida, has an incurable 
lung infection, the theme park’s veterinary team 
has announced. In February 2010, Tilikum 
dragged his trainer Dawn Brancheau into the 
pool and killed her. The whale was also involved 


to The Times, organized by 
protein scientist Alan Fersht, 
the group argues that the 
free movement of workers 
between EU countries helps 
in the recruitment of high- 
quality researchers to the 
United Kingdom. The letter’s 
signatories are all fellows of 
the Royal Society in London. 


Zika meeting 

With the Zika virus still 
spreading rapidly across the 
Americas, the World Health 
Organization (WHO) in 
Geneva held an emergency 
meeting on mosquito 
control on 14-15 March. 
The WHO’s Vector Control 
Advisory Group intends 

to review evidence to 
support new and innovative 
techniques for combating 
the Aedes aegypti mosquitoes 
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that transmit Zika virus, 
along with dengue and 
Chikungunya viruses. These 
techniques include deploying 
mosquitoes that have been 
made infertile through 
genetic modification or 
irradiation. 


PRESS 
Fermat proof prize 


Andrew Wiles has received 
the 2016 Abel Prize for 
mathematics for his solution 
to Fermat's last theorem, 

the Norwegian Academy 

of Science and Letters 
announced on 15 March. The 
problem had stumped some 
of the world’s greatest minds 
for three and a half centuries. 
Wiles, a number theorist now 
at the University of Oxford, 
UK, will receive 6 million 


© 2016 Macmillan Publishers Limited. All rights reserved 


in two deaths in the 1990s, and the story of his 
life in captivity was told in the controversial 2013 
documentary film Blackfish. SeaWorld bought 
Tilikum in 1983; he is thought to be 35 years old. 
The species life expectancy in captivity versus 
that in the wild is still debated by scientists. 


kroner (US$700,000) for his 
1994 proof showing that there 
cannot be any positive whole 
numbers x, y and z such that 
x"+y"=z", if nis greater than 
2. See go.nature.com/yflnxj 
for more. 


Infrastructure map 


The European Commission 
has published its latest 

wish list of the research- 
infrastructure projects that it 
considers most deserving of 
continent-wide support. The 
European Strategy Forum on 
Research Infrastructures road 
map, released on 10 March, 
details 21 facilities across 

all scientific areas to help 
national governments to 
prioritize how they spend 
infrastructure money, and 
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to encourage them to share 
costs and responsibilities. 
New facilities listed in the 
2016 road map include two 

in environmental sciences 

and one in health and food 
sciences, as well as solar and 
neutrino telescopes and an 
infrastructure for scientific 
research into cultural heritage. 


POLICY 


Call to save bees 
The US Government 
Accountability Office (GAO) 
says that US regulatory bodies 
need to do more to protect bee 
populations. In a report made 
public on 11 March, the GAO 
called on the US Department 
of Agriculture (USDA) to 
work more closely with 

other agencies to protect 

bee health. The report says 
that although the USDA has 
upped efforts to monitor 
honeybee colonies managed 
by beekeepers, it does not 
coordinate the monitoring of 
wild, native bees. The report 
also recommends that the 
Environmental Protection 
Agency identifies the mixtures 
of pesticides most commonly 
used by farmers. 


PEOPLE 
Minister keeps title 


German defence minister 
Ursula von der Leyen 
(pictured), who was accused 
in September 2015 of 


TREND WATCH 


The level of atmospheric carbon 
dioxide at the Mauna Loa 
Observatory in Hawaii rose by 
3.05 parts per million (p.p.m.) in 
2015 — the largest annual increase 
since records began 56 years ago, 
says the US National Oceanic and 
Atmospheric Administration. 
After correcting for seasonal 
swings from plant-growth cycles 
in the Northern Hemisphere, the 
average CO, concentration in 
2015 was 400.83 p.p.m. — a 43% 
rise compared to the CO, level 

of around 280 p.p.m. that existed 
during the pre-industrial era. 


hereditary breast or ovarian 
cancer. The database, called 
AmbryShare, was launched on 
8 March by Ambry Genetics, 
a genetic-testing company 

in Aliso Viejo, California 

— making Ambry the first 
private company to release 
its customers information 
for free. The Broad Institute 
of MIT and Harvard in 
Cambridge, Massachusetts, 
has an open-access database 
of more than 60,000 genomes 
collected from the public, but 
AmbryShare’s data currently 


plagiarism in her medical focus on specific diseases. 
dissertation in obstetrics, will Ambry hopes to release up to 
not lose the title of doctor 200,000 aggregated genomes 
or her job. The senate of per year from people with 
Hanover Medical School, various conditions. 

which awarded the title in . . : 

1990, announcedon9 March «Anda vaccine fight 
that its formal investigation The medical charity 

revealed that some passages Médecins Sans Frontiéres 

in von der Leyen’s dissertation (MSF) is challenging 

were copied from original pharmaceutical company 
sources. But these were mostly Pfizer’s application for 

in the introduction, it said, a patent in India on 

and the main body of research pneumonia vaccine 

was original and valid. Since PCV 13, marketed as 

2011, two German federal Prevenar 13 in India. MSF 
ministers have lost their titles says that it wants to allow 
and government posts to other manufacturers to make 
plagiarism charges. the vaccine, and lower its 


cost. The 11 March challenge 


| ——sBUSINESS asserts that the method that 


Pfizer is trying to patent 


Gene data shared is too obvious to deserve 
Researchers and the a patent under Indian law. 
public can now access a Pfizer is reported as saying 
database of anonymized that the complexity of the 
genetic information vaccine justifies the price. In 
from 10,000 people with partnership with the vaccine 


CARBON DIOXIDE LEVELS BREAK RECORDS 


The Mauna Loa Observatory in Hawaii recorded the biggest ever 
increase in the growth rate of CO, concentrations in 2015. 
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SEVEN DAYS | THIS WEEK | 


17-18 MARCH 
Commercializing 3D 
printing for biological 
applications is 
discussed at the second 
Tissue Engineering, 
Biofabrication & 
3D-Bioprinting in Life 
Sciences conference in 
Boston, Massachusetts. 
go.nature.com/rggrat 


21-23 MARCH 

NASA holds a meeting 
in Washington DC to 
develop its technology 
road maps. 
go.nature.com/dhmq2e 


21-25 MARCH 

The annual Lunar 
and Planetary Science 
Conference convenes 
in The Woodlands, 
Texas. 
go.nature.com/qpnoxd 


alliance GAVI, Pfizer has 
reduced the price of Prevenar 
since 2013. 


Mosquito trial 

A proposed field trial 

of genetically modified 
mosquitoes in the Florida 
Keys poses no threat 

to human health or the 
environment, the US Food 
and Drug Administration 
has determined. Members 
of the public have 30 days 

to submit comments on the 
draft assessment, which was 
released on 11 March. The 
Aedes aegypti mosquitoes 
developed by Oxitec of 
Oxford, UK, are engineered 
to produce short-lived 
young to temporarily reduce 
mosquito populations 

and combat diseases that 
they carry. The project 

has received increased 
attention from the media and 
politicians amid concerns 
about the spread of Zika 


virus. 


> NATURE.COM 
For daily news updates see: 
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What Cave genome Joint Europe-Russia The hunt for 
next in wake of computer pinpoints dawn of mission attempts to worlds at the fringes of 
Go victory? p.284 Neanderthals p.286 make history p.288 the Solar System p.290 


Performance boost paves 


way for ‘brain doping’ 


Electrical stimulation seems to boost endurance in preliminary studies. 


BY SARA REARDON 


lite ski jumpers rely on extreme balance 
B= power to descend the steep slopes 

that allow them to reach up to 100 kilo- 
metres per hour. But the US Ski and Snowboard 
Association (USSA) is seeking to give its elite 
athletes an edge by training a different muscle: 
the mind. 

Working with Halo Neuroscience in San 
Francisco, California, the sports group is testing 
whether stimulating the brain with electricity 
can improve the performance of ski jumpers by 


making it easier for them to hone their skills. 
Other research suggests that targeted brain 
stimulation can reduce an athlete's ability to 
perceive fatigue’. Such technologies could aid 
recovery from injury or let athletes try “brain 
doping’ to gain a competitive advantage. 

Yet many scientists question whether brain 
stimulation is as effective as its proponents 
claim, pointing out that studies have looked 
at only small groups of people. “They’re cool 
findings, but who knows what they mean,” says 
cognitive psychologist Jared Horvath at the 
University of Melbourne in Australia. 
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The USSA is working with Halo to judge the 
efficacy of a device that delivers electricity to the 
motor cortex, an area of the brain that controls 
physical skills. The company claims that the 
stimulation helps the brain to build new con- 
nections as it learns a skill. It tested its device in 
an unpublished study of seven elite Nordic ski 
jumpers, including Olympic athletes. 

Four times per week, for two weeks, the skiers 
practised jumping onto an unstable platform. 
Four athletes received transcranial direct- 
current stimulation (tDCS) as they trained; the 
other three received a sham procedure. The 
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> stimulation ultimately improved the athletes’ 
jumping force by 70% and their coordination 
by 80%, compared with the sham group, Halo 
announced in February. 

Troy Taylor, high-performance director for 
the USSA, is encouraged by the results — but 
concedes that they are preliminary. 


PUSHING THE LIMITS OF ENDURANCE 

Another study, presented on 7 March at the 
Biomedical Basis of Elite Performance meeting 
in Nottingham, UK, suggests that tDCS may 
reduce the perception of fatigue. Sports sci- 
entist Lex Mauger of the University of Kent in 
Canterbury, UK, and his colleagues found that 
stimulating the motor-cortex region that con- 
trols leg function allows cyclists to pedal longer 
without feeling tired. 

The researchers stimulated the brains of 
12 untrained volunteers before directing the 
athletes to pedal stationary bicycles until they 
were exhausted. Every minute, they asked the 
cyclists to rate their level of effort. 

Volunteers who received tDCS were able to 
pedal two minutes longer, on average, than were 
those who were given a sham treatment. They 
also rated themselves as less tired. But there was 
no difference in heart rate or the lactate level in 
the muscles between the treatment and control 
groups. This suggests that changes in brain per- 
ception, rather than muscle pain or other body 
feedback, drove the improved performance. 

Alexandre Okano, a biological engineer at 


Federal University of Rio Grande do Norte 
in Brazil, found similar increases in cyclists’ 
performance when he stimulated the brain’s 
temporal cortex, which is involved in body 
awareness and in automatic functions such as 
breathing”. This suggests that the temporal and 
motor cortices are connected in ways that are 
not understood, or that tDCS does not target 
locations in the brain precisely, Okano says. 

These results support the notion that the 
brain manages exertion by collating feedback 
from the body and then slowing muscles to 
prevent fatigue, says Dylan Edwards, a neuro- 
physiologist at Burke Medical Research Institute 
in White Plains, New York®. “Even when you 
think you're exercising as hard as you can, there 
is always some reserve of ability,’ he says. 


TRICKY TESTS 

But Horvath cautions that little is known about 
the long-term effects of stimulating the brain. 
And others are sceptical of the technique’s 
potential to increase performance. Vincent 
Walsh, a neuroscientist at University College 
London, notes that the methods used in tDCS 
studies often differ between research groups — 
and might not always be optimized. 

For instance, the fairly intense amount of 
electricity that Mauger’s team used has been 
shown to sometimes have complex and unin- 
tended effects on the brain’s activity’. 

Replicating such experiments is difficult 
because of variations in how people respond to 


brain stimulation. Some people do not respond 
at all; others might respond only when stimu- 
lated in a certain way. And even an individual's 
response can differ from day to day. Edwards 
says that it is important to map these differences 
if tDCS is to be used therapeutically or for other 
purposes. “We're moving toward customized 
prescription of brain stimulation,” he says. 

Nonetheless, the use of tDCS in sports is 
only likely to increase. Stimulating the motor 
cortex, for instance, seems to increase dexter- 
ity, so videogamers have been quick to take up 
the technique. And it is increasingly easy to 
acquire stimulation devices; Halo has begun to 
market its equipment for the express purpose of 
increasing athletic performance. 

Taylor compares the use of brain stimulation 
by athletes to eating carbohydrates ahead of an 
athletic event, in the hopes of boosting endur- 
ance. “It piggybacks on the ability to learn,’ he 
says. “It's not introducing something artificial 
into the body.” 

But Edwards worries that the availability of 
tDCS devices will tempt athletes to try “brain 
doping’, in part because there is no way to detect 
its use. “If this is real,” he says, “then absolutely 
the Olympics should be concerned about it” = 
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ARTIFICIAL INTELLIGENCE 


What Google’s winning Go 
algorithm will do next 


AlphaGo’s techniques could have broad uses, but moving beyond games is a challenge. 


BY ELIZABETH GIBNEY 


human players, the ancient game of Go 
has joined the growing list of tasks at 
which computers perform better than humans. 
Ina 6-day tournament in Seoul, watched bya 
reported 100 million people around the world, 
the computer algorithm AlphaGo, created by 
the Google-owned company DeepMind, beat 
Go professional Lee Sedol by 4 games to 1. The 
complexity and intuitive nature of the ancient 
board game had established Go as one the 
greatest challenges in artificial intelligence 
(AI). Now the big question is what the Deep- 
Mind team will turn to next. 
AlphaGo’s general-purpose approach — 


f ollowing the defeat of one of its finest 


which was mainly learned, with a few ele- 
ments crafted specifically for the game — 
could be applied to problems that involve 
pattern recognition, decision-making and 
planning. But the approach is also limited. 
“It’s really impressive, but at the same time, 
there are still a lot of challenges,’ says Yoshua 
Bengio, a computer scientist at the University 
of Montreal in Canada. 

Lee, who had predicted that he would win 
the Google tournament in a landslide, was 
shocked by his loss. In October, AlphaGo beat 
European champion Fan Hui. But the version 
of the program that won in Seoul is signifi- 
cantly stronger, says Jonathan Schaeffer, a com- 
puter scientist at the University of Alberta in 
Edmonton, Canada, whose Chinook software 
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mastered draughts in 2007: “I expected them 
to use more computational resources and doa 
lot more learning, but I still didn’t expect to see 
this amazing level of performance.” 

The improvement was largely down to the 
fact that the more AlphaGo plays, the better it 
gets, says Miles Brundage, a social scientist at 
Arizona State University in Tempe, who stud- 
ies trends in AI. AlphaGo uses a brain-inspired 
architecture known as a neural network, in 
which connections between layers of simulated 
neurons strengthen on the basis of experience. 
It learned by first studying 30 million Go posi- 
tions from human games and then improv- 
ing by playing itself over and over again, a 
technique known as reinforcement learning. 
Then, DeepMind combined AlphaGo’s ability 


KIM HONG-JI/TPX/REUTERS 


GOOGLE DEEPMIND 


Professional Go player Lee Sedol (centre) after his 4-1 defeat by Google’s AlphaGo algorithm. 


to recognize successful board configurations 
with a ‘look-ahead search; in which it explores 
the consequences of playing promising moves 
and uses that to decide which one to pick. 
Next, DeepMind could tackle more games. 
Most board games, in which players tend to 
have access to all information about play, are 
now solved. But machines still cannot beat 
humans at multiplayer poker, say, in which 
each player sees only their own cards. The 
DeepMind team has expressed an interest in 
tackling Starcraft, a science-fiction strategy 
game, and Schaeffer suggests that DeepMind 
devise a program that can learn to play differ- 
ent types of game from scratch. Such programs 
already compete annually at the International 
General Game Playing Competition, which is 
geared towards creating a more general type of 
AI. Schaeffer suspects that DeepMind would 
excel at the contest. “It’s so obvious, that ’m 
positive they must be looking at it,” he says. 
DeepMind’s founder and chief executive 
Demis Hassabis mentioned the possibility of 
training a version of AlphaGo using self-play 
alone, omitting the knowledge from human- 
expert games, at a conference last month. The 
firm created a program that learned to play less 
complex arcade games in this manner in 2015. 
Without a head start, AlphaGo would probably 
take much longer to learn, says Bengio — and 


THE GO FILES 


MORE 
ONLINE 


might never beat the best human. But it’s an 
important step, he says, because humans learn 
with such little guidance. 

DeepMind, based in London, also plans to 
venture beyond games. In February the com- 
pany founded DeepMind Health and launched 
a collaboration with the UK National Health 
Service: its algorithms could eventually be 
applied to clinical data to improve diagnoses or 
treatment plans. Such applications pose differ- 
ent challenges from games, says Oren Etzioni, 
chief executive of the non-profit Allen Institute 
for Artificial Intelligence in Seattle, Washing- 
ton. “The universal thing about games is that 
you can collect an arbitrary amount of data,” 
he says — and that the program is constantly 
getting feedback on what’s a good or bad move 
by playing many games. But, in the messy real 
world, data — on rare diseases, say — might be 
scarcer, and even with common diseases, label- 
ling the consequences of a decision as ‘good’ or 
‘bad’ may not be straightforward. 

Hassabis has said that DeepMind’s algo- 
rithms could give smartphone personal assis- 
tants a deeper understanding of users’ requests. 
And AI researchers see parallels between 
human dialogue and games: “Each person is 
making a play, and we have a sequence of turns, 
and each of us has an objective,’ says Bengio. 
But they also caution that language and human 


Updates from | @ Horse-sized dinosaur sheds light on 
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man versus @ Peculiar pattern found in ‘random’ 
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cholesterol mutation to heart disease 


interaction involve alot more uncertainty. 

DeepMind is fuelled by a “very powerful 
cocktail” of the freedoms usually reserved for 
academic researchers, and by the vast staffand 
computing resources that come with being a 
Google-backed firm, says Joelle Pineau, a com- 
puter scientist at McGill University in Mon- 
treal. Its achievement with Go has prompted 
speculation about when an AI will have a ver- 
satile, general intelligence. “People’s minds race 
forward and say, if it can beat a world cham- 
pion it can do anything,” says Etzioni. But deep 
reinforcement learning remains applicable only 
in certain domains, he says: “We are a long, long 
way from general artificial intelligence.” 

DeepMind’s approach is not the only way 
to push the boundaries of AI. Gary Marcus, a 
neuroscientist at New York University in New 
York City, has co-founded a start-up company, 
Geometric Intelligence, to explore learn- 
ing techniques that extrapolate from a small 
number of examples, inspired by how chil- 
dren learn. In its short life, AlphaGo probably 
played hundreds of millions of games — many 
more than Lee, who still won one of the five 
games against AlphaGo. “It’s impressive that 
a human can use a much smaller quantity of 
data to pick up a pattern,” says Marcus. “Prob- 
ably, humans are much faster learners than 
computers.” = 
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HUMAN EVOLUTION 


Ancient DNA pinpoints 
dawn of Neanderthals 


Sequencing of 430,000-year-old DNA pushes back species’ divergence from humans. 


BY EWEN CALLAWAY 


atthias Meyer has just published the 
Me of what may be the world’s 

most wasteful genome-sequencing 
project. In decoding just 0.1% of the genome 
of the oldest DNA ever recovered from an 
ancient human, the molecular biologist at the 
Max Planck Institute for Evolutionary Anthro- 
pology in Leipzig, Germany, threw out enough 
raw data to map the modern human genome 
dozens of times over. 

But the excess was necessary, 
because the DNA in the 
430,000-year-old bones was 
degraded and contaminated. Meyer's 
feat of recovery has revealed that the 
remains, from a cavern in northern 
Spain, represent early Neanderthals 
— and has pushed back estimates 
of the time at which the ancient 
predecessors of humans must have 
split from those of Neanderthals 
(M. Meyer et al. Nature http://dx.doi. 
org/10.1038/nature17405; 2016). 

“Starting such a thing is already 
very ambitious, and managing 
it is even more impressive,” says 
Ludovic Orlando, an ancient-DNA 
researcher at the Natural History Museum 
of Denmark in Copenhagen. “We are really 
reaching the limits of what is possible.” 

The analysis addresses confusion over 
which species the remains belong to. A report 
published in 2013 sequenced a femur’s mito- 
chondrial genome — which is made up of 
DNA from the cell’s energy-producing struc- 
tures that is more abundant in cells than is 
nuclear DNA. It suggested that at least one 
individual identified from the remains was 
more closely related to a group called Den- 
isovans — known from remains found thou- 
sands of kilometres away in Siberia — than 
it was to European Neanderthals (M. Meyer 
et al. Nature 505, 403-406; 2014). 

“It's wonderful news to have mitochondrial 
and nuclear DNA from something that is 
430,000 years old. It’s like science fiction. It’s an 
amazing opportunity,’ says Maria Martinén- 
Torres, a palaeoanthropologist at University 
College London. 

The remains are known as the Sima 
hominins because they were found in Sima 


de los Huesos (Spanish for ‘pit of bones’), a 
13-metre-deep shaft in Spain’s Atapuerca 
mountains. Few ancient sites are as important 
or intriguing as Sima, which holds the remains 
of at least 28 individuals, along with those of 
dozens of cave bears and other animals. The 
hominins might have plummeted to their 
death, but some researchers think they were 
deliberately buried there. 

The Sima hominin skulls have the begin- 
nings of a prominent brow ridge, as well as 


Bone powder from the femur of a 430,000-year-old ‘Sima’ skeleton. 


other traits typical of Neanderthals. But other 
features, and uncertainties around their age 
— some studies put them at 600,000 years 
old, others closer to 400,000 — convinced 
many researchers that they might instead 
belong to an older species known as Homo 
heidelbergensis. 

Confusion peaked when Meyer, his col- 
league Svante Paabo and their team revealed 
the mitochondrial connection to the Den- 
isovans. But they hoped that retrieving the 
skeletons’ nuclear DNA — which represents 
many more lines of ancestry than does mito- 
chondrial DNA, which is inherited solely from 
the maternal line — would clear things up. 


NUCLEAR RECOVERY 

Meyer's team managed to glean nuclear and 
mitochondrial DNA from five Sima samples, 
probably representing different individuals. A 
key factor in their success, says Meyer, was that 
since 2006, archaeologists had carefully refrig- 
erated teeth and shoulder-blade tissue from the 
pit to preserve the ancient DNA — awaiting 
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advanced molecular-analysis techniques. 

The nuclear DNA, Meyer's team reports in 
Nature on 14 March, shows that the Sima homi- 
nins are in fact early Neanderthals. And its age 
suggests that the early predecessors of humans 
diverged from those of Neanderthals between 
550,000 and 765,000 years ago — too far back 
for the common ancestors of both to have been 
Homo heidelbergensis, as some had posited. 

Researchers should now be looking for 
a population that lived around 700,000 
to 900,000 years ago, says 
Martinon-Torres. She thinks that 
Homo antecessor, known from 
900,000-year-old remains from 
Spain, is the strongest candidate 
for the common ancestor, if such 
specimens can be found in Africa 
or the Middle East. 

The team’s latest mitochondrial 
sequences, meanwhile, again con- 
firm the puzzling link between 
the Sima hominins and the Den- 
isovans. Meyer suggests that the 
ancestors of the two groups car- 
ried mitochondrial DNA that is 
reflected in both — but which is 
not present in later Neanderthals. 
This elimination could have hap- 
pened by chance, but Meyer now favours the 
hypothesis that an as yet unknown species 
from Africa migrated to Eurasia and bred with 
Neanderthals, replacing the mitochondrial 
DNA lineages. (Supporting this idea, stone- 
tool technologies spread from Africa to Eura- 
sia around half a million years ago, and again 
250,000 years ago). 

It is hard to rule out these or other ideas 
without new data, says Meyer. The full or 
nearly full genome of a Sima hominin, or 
genetic data from other early Neanderthals, 
would be necessary. 

“It’s fascinating and keeps us all on our 
toes trying to make sense of it all,” says Chris 
Stringer, a palaeoanthropologist at the Natu- 
ral History Museum in London. Stringer says 
that the recovery of such old nuclear DNA 
gives him hope that researchers will be able to 
analyse ancient DNA that stretches even fur- 
ther back in time. “Instead of just being stuck 
with trying to resolve the last 100,000 years,” 
he says, “we can really start to put some dates 
from DNA further down the human tree? m 
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MATHEMATICS 


Fermat proof earns 
theorist Abel Prize 


Andrew Wiles rewarded for cracking historic puzzle. 


BY DAVIDE CASTELVECCHI 


ritish number theorist Andrew Wiles 
B has received the 2016 Abel Prize for his 
solution to Fermat's last theorem — a 
problem that stumped some of the world’s 
greatest minds for three and a half centuries. 
The Norwegian Academy of Science and Letters 
announced the award — considered by some to 
be the ‘Nobel of mathematics’ — on 15 March. 
Wiles, who is 62 and now at the University 
of Oxford, UK, will receive 6 million kroner 
(US$700,000) for his 1994 proof of the theorem, 
which states that there cannot be any positive 
whole numbers x, y and z such that x"+y"=z", 
if nis greater than 2. The award came as a “total 
surprise’, he told Nature. 
That Wiles solved a problem considered 
too hard by so many — and yet a problem that 


is relatively simple to state — has made him 
arguably “the most celebrated mathematician 
of the twentieth century’; says Martin Bridson, 
director of Oxford’s Mathematical Institute, 

which is housed in a 


“Ag human building named after 
beings we Wiles. Although his 
succe ed by trial achievement is now 
’ two decades old, he 

ro mess a continues to inspire 
ens es i othe young minds, some- 
hackewh thing that is apparent 
setbac es 0 when schoolchildren 
succeed. show up at his public 


lectures. “They treat 

him like a rock star,’ Bridson says. “They line 
up to have their photos taken with him.” 

Wiles’s story has become a classic tale of 

tenacity and resilience. While a faculty member 
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at Princeton University in New Jersey in the 
1980s, he embarked on a solitary, seven-year 
quest to solve the problem, working in his attic 
without telling anyone except his wife. He went 
on to make the historic announcement of his 
achievement at a conference in his hometown 
of Cambridge, UK, in June 1993, only to hear 
froma colleague two months later that his proof 
contained a serious mistake. 

With another frantic year of work — and 
the help of one of his former students, Richard 
Taylor, who is now at the Institute for Advanced 
Study in Princeton — he was able to patch up 
the proof. When the resulting two papers were 
published in 1995, they made up an entire issue 
of the Annals of Mathematics'”. 

But after Wiles’s original claim had already 
made front-page news around the world, the 
pressure on the shy mathematician to save his 
work almost incapacitated him. “Doing math- 
ematics in that kind of overexposed way is 
certainly not my style, and I have no wish to 
repeat it,” he said in a BBC documentary in 
1996, still visibly shaken by the experience. 

“Unfortunately, as human beings, we 
succeed by trial and error, he told Nature after 
hearing about his win. “It’s the people who 
overcome the setbacks who succeed.” = 
1. Wiles, A. Ann. Math. 141, 443-551 (1995). 


2. Taylor, R. & Wiles, A. Ann. Math. 141, 553-572 
(1995). 
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Mars launch puts Russia- 
Europe team to the test 


Joint ExoMars mission launches lander and orbiter, witha rover planned for 2018. 


BY ELIZABETH GIBNEY 


either Europe nor Russia 
has ever successfully oper- 
ated a mission on Mars’s 


surface. Now the European Space 
Agency (ESA) and its Russian coun- 
terpart Roscosmos hope to mark a 
first for both organizations with the 
joint ExoMars 2016 mission, which 
launched from the Baikonur Cosmo- 
drome in Kazakhstan on 14 March. 

ExoMars 2016 consists of a lander 
that will study the planet's dust storms 
and an orbiter that will analyse its 
atmosphere, including looking for 
methane. The orbiter will also act 
as a relay for a Mars rover, due to be 
launched in 2018. Each phase will test 
the growing collaboration between 
the two agencies, which have hinted 
at further joint missions, including 
uncrewed and crewed Moon trips. 

ESA designed the orbiter and 
lander projects, but a Russian Pro- 
ton rocket launched them, and they 
carry Russian instruments. “The launch is cru- 
cial because it’s symbolic,” says Oleg Korablev 
of the Space Research Institute, Moscow, who 
is principal investigator for the Atmospheric 
Chemistry Suite on the orbiter. “It’s psycho- 
logically very important.’ 

ESA project scientist Jorge Vago adds: 
“Hopefully this will cement a way of doing 
things that becomes the modus operandi for 
when we do missions together.” Based at the 
European Space Research and Technology 
Centre in Noordwijk, the Netherlands, Vago 
also works on the ExoMars 2018 mission. 

ESA approved the ExoMars concept in 2005; 
a subsequent merry-go-round of collaborators 
eventually resulted in the Europe—Russia col- 
laboration and ExoMars’s unusual two-stage 
format (see ‘Mission merry-go-round). 

The ExoMars 2016 craft — the heaviest Mars 
mission ever launched, at 4,332 kilograms — 
is now bound for the red planet. A favourable 
alignment of Earth and Mars means that the 
craft should reach orbit in 7 months; the orbiter 
and landing module, Schiaparelli, will separate 
before reaching the Martian atmosphere. 

The landing wont involve anything as com- 
plex as NASA's sky crane, which delivered 


A Russian Proton rocket launches ExoMars 2016 from Kazakhstan. 


the Curiosity rover to Mars in 2012. But it is 
ambitious, says Vago, and designed to show 
that Europe can make a controlled landing on 
Mars. Heeding lessons from Beagle 2 (Brit- 
ain’s failed 2003 Mars lander that was oper- 
ated by ESA), the module will use drag from 
the Martian atmosphere to brake, then open 
a parachute and finally fire its thrusters. Dur- 
ing the last 2 metres, Schiaparelli will deploy a 
honeycomb-like crash pad. 

The first lander to set down during dust- 
storm season, Schiaparelli will monitor pressure 
and temperature and image the approaching 
landing site as it descends. On the ground, the 
conical lander has just 2-4 days of battery power 
to perform experiments. 

Its tiny meteorological station DREAMS 
(Dust Characterisation, Risk Assessment, and 
Environment Analyser on the Martian Surface) 
will measure pressure, humidity, temperature, 
wind speed and direction. This represents a 
unique chance to study dust circulation and 
hopefully unravel the mystery of why some 
storms on Mars go planet-wide, says Francesca 
Esposito at the INAF Astronomical Observa- 
tory of Capdiomonte in Naples, Italy, and the 
principal investigator for DREAMS. The lander 
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will also be the first to examine the 
planet’s electric field. Those data will 
feed into Martian climate models 
and could allow scientists to better 
predict future disturbances to com- 
munications on the planet, she says. 


BIOLOGICAL STUDIES 

The higher-profile science — 
including investigating hints of 
Martian biology — will take place 
in the sky on board ExoMars’ Trace 
Gas Orbiter (TGO). That phase will 
begin at the end of 2017, once the 
craft has manoeuvred into a circu- 
lar, 400-kilometre-high orbit. 

While studying Mars’s atmos- 
phere, the TGO’s major task will be 
to follow up on evidence that the red 
planet contains methane, which has 
been associated with active geologi- 
cal processes, as well as biological 
ones. “Is there a seasonality to the 
methane, or are the concentrations 
associated with particular types of 
terrain, for instance?” asks John 
Bridges, a planetary scientist at the University 
of Leicester, UK, who works on the TGO’s ste- 
reo camera. The camera will use 3D images to 
chart geological features; a hydrogen detector 
will map the planet’s subsurface water. 

The orbiter will also serve as a communica- 
tions platform, including for the ExoMars rover, 
which will break new ground — literally. The 
rover will drill up to 2 metres into the surface, 
where organic matter, which can be destroyed 
by surface radiation, may lie preserved. 

The rover project will require the Russian 
and European teams to work together to an 
extent unprecedented for ESA, says Vago. In 
the 2016 mission, their responsibilities are rela- 
tively separate, but in the 2018 mission, he says, 
“there is no clean line’, and each design tweak 
ripples through the work of both teams. It is a 
new experience for Russia too, says Korablev, 
even though the country has long contributed 
scientific instruments to foreign space missions. 
“There are many problems, but there are always 
problems on national projects too,” he says. 

This complexity, coupled with late-running 
instruments, delays in testing and a lack of 
cash, means that the 2018 rover mission 
could be delayed until 2020, says Vago. ESA's 
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director-general, Johann-Dietrich Worner, 
said in January that the 2018 mission needed 
more funding to meet its launch target, and 
the agency is expected to ask member states 
for the missing few hundred million euros at a 
meeting in December. Success with ExoMars 
2016 could help to persuade European leaders 
to contribute. Bridges says that most scientists 
will accept a delay as long as it means that all 
the instruments are on the craft and working. 

Korablev’s involvement with Mars missions 
has been an emotional roller coaster. He spent 
10 years working on Russia’s Mars 96 orbiter, 
which failed to leave near-Earth orbit. He was 
also involved in a sample-return mission to the 
Martian moon Phobos, which ran into prob- 
lems, eventually crashing in the Pacific Ocean 
in 2012. “We put a great effort into ExoMars,” 
he says. “I almost don't dare to say any words.” m 


The countries involved in the ExoMars mission, 
as well as the mission components, have 
changed multiple times since its conception. 


2005 European Space Agency (ESA) 
approves ExoMars as a Europe-only, 
single-rover Mars mission. 


[Gese) a 


2008 Worldwide financial crisis. 


2009 Europe can’t go it alone; NASA 
offers to launch and land rover, and 
wants orbiter added to mission. Europe 
adds a second, stationary lander to 
mission plan because it wants to develop 
landing capability. 


2011 NASA has funding difficulties, 
rumoured to be due to overrun of 
James Webb Space Telescope. 


2012 NASA pulls out of ExoMars; 
Roscosmos steps in, agrees to launch 
orbiter and lander, to launch and land 
rover and contribute instruments to 


both missions. 
| cosa a aw 8 Sw 


Orbiter 
and lander launch. 
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PLANETARY SCIENCE 


NASA Mars woes 
could delay missions 


Decision to defer InSight launch will cost US$150 million. 


BY DEVIN POWELL 


racks in an instrument designed 

to detect earthquakes on Mars 

will add roughly US$150 million 
to the price tag of InSight, NASA's next 
mission to the red planet. But the agency 
said on 9 March that it still intends to 
fly the spacecraft, raising questions 
about how the unexpected expense 
will affect other planetary missions 
in development. 

Although InSight’s launch — originally 
scheduled for this month — is now slated 
for May 2018, it is not clear whether the 
spacecraft’s faulty seismometer will be 
ready in time. InSight seeks to investigate 
Mars’s interior by measuring seismic activ- 
ity, as well as the heat that is escaping from 
the planet and the movement of its surface. 

“We're really grateful that NASA has 
recognized the value of science we're going 
to do and agreed to give us a chance to try it 
again,” says InSight’s principal investigator, 
Bruce Banerdt, who works at NASA’s Jet 
Propulsion Laboratory (JPL) in Pasadena, 
California. 

The spacecraft was developed as part of 
NASA's Discovery Program, which funds 
small, quick-turnaround missions whose 
costs are capped at $450 million. Five pro- 
posed missions are currently vying for a 
chance to launch in the early 2020s. These 
include a trip to Jupiter’s Trojan asteroids, 
two missions to Venus and a camera that 
would detect near-Earth objects. 

“NASA has been trying to choose two 
missions out of this round instead of one, 
and the community’s concern is that the 
likelihood of that happening might be 
falling,” says Linda Elkins-Tanton, a plan- 
etary scientist at Arizona State University 
in Tempe and principal investigator of a 
proposed mission to the asteroid Psyche. 

Jim Green, director of NASA's planetary- 
science division in Washington DC, says 
that budget details and consequences to 
other planetary missions will be worked 
out by August. “Our ability to select at 
least one Discovery mission in December 
is expected to be unaffected, he says. 

Then there is the matter of whether 
InSight’s troubled seismometer, which 
was developed by a global collaboration 


led by the Paris Institute of Earth Physics, 
can be repaired. 

Banerdt says that Sodern, the French 
company subcontracted to build a vacuum 
container to enclose the seismometer’s 
sensors, did not detect any problems with 
connectors that are supposed to seal wires 
leading out of the vacuum housing. Only 
when the instrument was tested in frigid, 
Mars-like temperatures in December did 
cracks in those seals become apparent. The 
project team tried to patch the problem, 
but persistent leaks remained. 

“It’s very frustrating, says Banerdt. 
“[’ve been working on getting this kind of 
mission for more than 25 years, and every- 
thing else on the project was going really 
well” 


FUTURE FIX 

NASA has asked the JPL to craft a new, 
hardier vacuum chamber. The agency’s 
French collaborators will test the cham- 
ber at their own expense. “Personally, Iam 
relieved to know that JPL will be taking 
responsibility for the vacuum chamber,’ says 
Lisa Pratt, a biogeochemist at Indiana 
University Bloomington. 

The Mars InSight team is now re-running 
landing simulations and recalculating orbits 
to account for the updated launch date. 

Some wonder 


“We're really whether the mis- 

tefulthat take could cause 
tear de NASA to tighten 
poe gnize dthe the reins on future 
value of science projects. The most 

‘ recent call for Dis- 
“a covery mission 


proposals — made 
before the problem 
with InSight occurred — mandated that no 
more than one-third of instrument costs be 
spent on foreign sources. 

“The word on the street is that NASA’s 
a little more wary of collaborating with 
groups that they don't know so well or don't 
control directly,” says Elkins-Tanton. 

But Green argues that any nation try- 
ing to build a new instrument could have 
made this mistake. “This is the first time 
this type of instrument has been built to 
withstand harsh environmental conditions 
on another planet,” he says. = 
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ON THE HUNT FOR 
MYSTERY PLANET 


SCIENTISTS ARE SEARCHING FOR AN UNSEEN WORLD 
AT THE FRINGES OF THE SOLAR SYSTEM. 


stronomer Scott Sheppard runs through his checklist as he settles in fora long night BY ALEXANDRA WITZE 
of skygazing at the Subaru telescope atop Mauna Kea, Hawaii. The air above the 

summit: clear. The telescope: working smoothly. His 3-terabyte hard drive: emptied 

and ready to accept a flood of fresh data in the hours to come. 

On a wall in the observing room, three clocks track the hours in Hawaii, Tokyo 
and Coordinated Universal Time. Screens display every tic of the weather above the 
summit: wind direction, temperature and the dreaded humidity levels that could 
end this November night of observing if they were to rise. But for now, conditions 
are nearly perfect, especially when it comes to a characteristic known as seeing — a 
measure of how stable the stars above look. “Seeing is point-five-five,’ says David 
Tholen, an astronomer at the University of Hawaii in Manoa. “It doesn't get much 
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“IF THIS BIG OBJECT 
IS OUT THERE, IT 


BASICALLY CHANGES 
OUR PERCEPTION OF 
THE SOLAR SYSTEM.” 


better than that; replies the third member of this team, Chad Trujillo of 
the Gemini Observatory in Hilo, Hawaii. Sheppard, the lone mainlander 
of the group, works at the Carnegie Institution of Science in Washington 
DC. With the weather looking promising, he pulls out his logbook and 
begins to outline plans for the next ten hours. 

Between twilight and dawn, he will methodically direct Subaru’s 
enormous 8.2-metre mirror — one of the largest in the world — to stare 
deeply at one patch of the sky, then another and another. Several hours 
later, he will look at the same areas for a second time, and after that, a third. 
By comparing the staggered images, the researchers can hunt for objects 
that move ever so slightly over the course of a few hours. These would 
be distant worlds beyond Pluto, in the most extreme reaches of the Solar 
System. This is the realm of the long-sought Planet X. 

Sheppard already has some idea of where to look. On his list of goals 
for the night, he wants to capture fresh images of an object that he first 
spotted one month earlier. At 9:20 p.m., and again at 10:46 p.m., he 
aims Subaru at a region near the constellation Aries, where he thinks 
this object will be. The exposures come off the telescope, and Tholen 
begins to process them. After a few minutes, he beckons Sheppard over. 
On his grey screen is a light-coloured dot that jumps against the field of 
background stars when Tholen toggles between the two images. “There 
it is,” he says. “You got it” 

“Right where it should be,” Sheppard says. It is the same object that 
he saw previously, and it earns an entry in his logbook: in an image 
on computer chip number 104, in field number 776, they have spot- 
ted an object 90 astronomical units (au) away, or 90 times the Earth- 
Sun distance. It’s not clear yet how big the object is or whether it is 
scientifically important — but it is among the most distant worlds ever 
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Astronomer Scott Sheppard 

is searching systematically for 
distant objects — one of which 
could be the elusive Planet X. 


seen in the Solar System. 

Astronomers have discovered more 
than 2,000 exoplanets around other 
stars, mostly through indirect meth- 
ods that detect changes in the distant 
star. Yet the farthest reaches of our Solar System remain largely unex- 
plored: the indirect techniques won't work in our own stellar neigh- 
bourhood, and objects in the distant suburbs of the Sun are too faint 
to be glimpsed by anything but the world’s most powerful telescopes. 
Sheppard and Trujillo have been racing to find the frigid worlds that are 
thought to populate this distant zone. At stake is what could be the last 
great discovery in the Solar System — a planet bigger than Earth that 
may swing around the Sun far past Pluto. 

Suggestions of a Planet X have circulated for more than a century, but 
those hypotheses have always fallen apart after closer scrutiny. In 2014, 
Trujillo and Sheppard revived the concept of a putative Planet X, on the 
basis of the orbits of some extremely distant objects’. In January, the 
idea got a boost when two astronomers from the California Institute of 
Technology (Caltech) in Pasadena calculated more precisely where in 
the Solar System it might lie’. They dubbed it ‘Planet Nine’: a back- 
handed reference to the demotion of Pluto from planet to dwarf planet 
in 2006. 

The hunt is now on to find Planet Nine, or any other unseen 
super-Earths that may lurk out there on other orbits. The quest is 
likely to reveal fundamental insights into how the Solar System formed 
4.6 billion years ago, and how it has evolved since then. “If this big object 
is out there, it basically changes our perception of the Solar System,” says 
Sheppard. “It’s true discovery at its core.” 


UNKNOWN ZONE 

When Sheppard and Trujillo began to chase distant worlds, they were 
hoping to follow in the footsteps of other legendary astronomers. In 
1846, Johann Galle of Germany first spotted Neptune, the eighth planet, 
at a distance of about 30 au, right where it was expected to be according 
to calculations of how it would gravitationally perturb Uranus. In 1930, 
US astronomer Clyde Tombaugh found Pluto, orbiting at a distance 
of around 40 au. And in 1992, astronomers David Jewitt, then at the 
University of Hawaii, and Jane Luu, then at the University of California, 
Berkeley, discovered an even more distant object, beginning the explora- 
tion ofa region of space called the Kuiper belt’. 

Since then, astronomers have found thousands of Kuiper belt objects: 
small icy worlds similar to Pluto that range in distance from about 30 
AU to 50 au from the Sun. Beyond that lies the equivalent of ‘here be 
dragons’ on old maps. Scientists sometimes call it the outermost Kuiper 
belt or the innermost Oort cloud — the next region of the Solar System, 
which is thought to extend to at least 100,000 av. “There's a whole chunk 
of the Solar System we don't fully understand,” says Meg Schwamb, a 
planetary astronomer at the Academia Sinica in Taipei. “It’s one of the last 
unexplored territories” 

Which is why Sheppard and Trujillo are out looking. They met as 
graduate students at the University of Hawaii, where both had Jewitt 
as their adviser. They worked together to hunt Kuiper belt objects, and 
then started a systematic survey to search for still more-distant worlds. 
They are the only team routinely looking for the most-extreme objects. 
“The population could be huge,’ says Trujillo. “That’s why we're doing 
the search” 

By 2012, the two were using the biggest light buckets they could get 
their hands on, with wide-field cameras that would let them view as 
much of the sky as possible. At the Dark Energy Camera, atop a 4-metre 
telescope in Chile, they got a hit almost immediately. On their first night 
of observing, they spotted an object that was moving so slowly that it 
had to be very distant. Thrilled, they watched it move during the course 
of a year, which provided enough data to calculate its orbit. 

They found that the closest it ever gets to the Sun — a measure known 
as its perihelion — is 80 au, beyond the bulk of the Kuiper belt. This 
made it the object with the farthest-known perihelion, just beating the 
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dwarf planet Sedna, whose perihelion is 76 au. 

The discovery of this object, called 2012 VP,,;, led to a Nature paper’ 
— and a lot more observing time on big telescopes. In 2014, Shep- 
pard and Trujillo spent their first nights at Subaru, a facility run by the 
National Astronomical Observatory of Japan that carries a huge camera 
called the Hyper Suprime-Cam. The combination of a big telescope and 
a wide-field camera makes Subaru the world’s best place to scan large 
sections of the sky for faint objects. 

Many scientists work with Subaru remotely: they stay at sea level in 
Hilo, and use videoconferencing to communicate with the telescope’s 
operators. That approach saves researchers from making the 2-hour- 
long journey to the summit of Mauna Kea at 4,200 metres, where the 
atmosphere has 40% less oxygen and causes many people to experience 
dizziness, headaches or sometimes more-serious medical problems. 

But Sheppard likes to be actively involved in directing observations, 
so he always makes the trip. As the hours tick by during the night, he 
stays alert, never once clipping an oxygen sensor onto his finger to see 
how he is coping with the altitude. His logbook fills up with notations: 
field number, chip number, exposure time. He reorders targets on the 
fly, rearranging what he is looking at to improve the time gap between 
the fields. 

Subaru’s huge mirror gazes at the sky, gathering photons for him. 
Exposure times count down in big green numbers on a computer moni- 
tor. When an exposure finishes, an alert dings like a cuckoo clock, and 
Sheppard hovers over the shoulder of the telescope operator to tell him 
where to point the camera next. 

Each good observing night fills up Shep- 
pard’s Macbook with data. To identify 
potential distant worlds, the researchers use 
a programme that Trujillo wrote to pick out 
objects that move between different frames 
of the same star field. But because the pro- 


“EVERY IMAGE — 
YOU NEVER KNOW 


whether it veers inward at its perihelion. 

By far the most prized quarry out there is the hypothesized Planet 
Nine. In their 2014 Nature paper, Trujillo and Sheppard suggested — on 
the basis of the orbits of 2012 VP,,,and Sedna — that an unseen super- 
Earth could lurk at roughly 250 au. This January, Konstantin Batygin 
and Mike Brown of Caltech took these two bodies, along with four other 
distant Kuiper belt objects, and compared their orbits to narrow the 
calculations of where such a planet might lie. 

All six objects share a common orbital property: when they pass clos- 
est to the Sun, they are travelling from north to south relative to the 
plane of the Solar System. If they had no relation to one another, they 
should not all share that orientation. A second line of argument is that 
the six objects are also physically clustered in space (see ‘Out there’). 
“They all point in the same direction and are all tilted at the same angle,” 
says Batygin. “That's odd.” 

He and Brown argue that an unseen Planet Nine must be shepherding 
them into those clusters. It would be between 5 and 10 times the mass of 
Earth, and travel as close as 200 au to the Sun and as far away as 1,200 av. 


FINDING THE NINTH 

Critics say that the argument rests on just a handful of weird Kuiper 
belt objects. “It’s very small statistics,” says David Nesvorny, a planetary 
scientist at the Southwest Research Institute in Boulder, Colorado, who 
nonetheless finds the concept intriguing. “It’s as science should be — at 
the edge of believability:” 

Many astronomers are now running their own 
calculations to estimate the chances that Planet 
Nine exists in this particular orbit, and if not, 
where it might be. Samantha Lawler, of National 
Research Council-Herzberg in Victoria, Canada, 
is working with Nathan Kaib, of the University of 
Oklahoma in Norman, to explore how the pres- 


gramme flags a number of false positives, each 
field must also be reviewed manually. Shep- 
pard goes through every exposure, eyeballing 
the faint dots that the programme circled in 
orange to decide whether they represent a dis- 
tant Solar System object or something else — 


WHAT YOU'RE GOING 
TO GET. IT COULD BE 


ence of a super-Earth might affect the orbits of 
many Kuiper belt objects. Their preliminary 
results suggest that, ifa Planet Nine were out there, 
it should have nudged the orbits of Kuiper belt 
objects in ways that do not reflect reality. 

Planet Nine “is a cool idea, and it would be 


an asteroid or a cosmic-ray blip, perhaps. 

Frame by frame, Sheppard zips through 
thousands of exposures as if he’s playing a 
video game. “It’s exciting to go through,” he 
says. “Every image — you never know what 
you're going to get. It could be the image with 
the super-Earth in it” 

The key is how slowly the objects move. 
Asteroids are relatively close to Earth, and their position in the sky can 
shift by 30 arcseconds, or 0.008 degrees, each hour. Kuiper belt objects, 
which are much farther away, traverse about 3 arcseconds of sky each 
hour. Anything slower than that must be beyond the main Kuiper belt 
and is something that interests the search team. 

Astronomers must observe an object multiple times over the course 
ofa year to pin down its orbit and determine its perihelion. Just because 
an object is remote when it is discovered does not mean that it is scien- 
tifically important. 

For instance, the object that Sheppard spotted at 90 au in November 
may have been at its closest approach to the Sun. If so, that would make 
it a record-breaker, situated beyond Sedna and 2012 VP,,;. Or the object 
might be travelling on a path that takes it much, much closer to the Sun, 
perhaps 40 au. That would make it less exciting, because its perihelion 
distance would place it squarely within the main Kuiper belt — meaning 
that it is just another ordinary Kuiper belt object, as opposed to one of 
the extreme worlds. 

The same is true for an object that the scientists found at 103 au last 
November — the most distant ever observed. They will not know for 
many months whether that body stays in the outer Solar System, or 
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really neat if it was true’, says Lawler. “But you 
have to be really careful.” 

Some answers may come from an ongoing 
project known as the Outer Solar System Origins 
Survey (OSSOS), run by a consortium of inves- 
tigators. It is working to find and study all the 
observable Kuiper belt objects in a small patch 
of the sky in extraordinary detail — by follow- 
ing their orbits, classifying their colours and so on. That work has the 
potential to rule out the existence of Batygin and Brown’s hypothesized 
Planet Nine — if OSSOS were to find a distant object in a region that 
should have been cleared out by the proposed planet. 

Other astronomers have suggested alternative ways to hunt Planet 
Nine, such as looking at data from the Cassini spacecraft orbiting 
Saturn to see whether that planet’s orbit is perturbed ever so slightly, or 
by using cosmological telescopes at the South Pole to detect a planet's 
faint radiation. As Sheppard and Trujillo continue their methodical sur- 
vey of the sky, they are paying special attention to areas where Batygin 
and Brown say the planet could be. And the Caltech pair is chasing it 
as well, also using Subaru. “Id be astonished if there isn’t some kind of 
planet there,’ says Renu Malhotra, a theorist at the University of Arizona 
in Tucson. Ina paper on the preprint server arXiv, she and her colleagues 
put forward a new analysis* of where a super-Earth might lurk, ona 
different orbit from Batygin and Brown's Planet Nine. Malhotra’s team 
uses four extreme Kuiper belt objects to suggest that an unseen planet 
moves around the Sun every 17,000 years. 

But even if a large planet is out there, it will take some luck to find 
it with existing technology. For one of the teams to spot the object, it 
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OUT THERE 


Astronomers are searching for 
extreme worlds in the outermost 
Kuiper belt. Some of the objects 
found so far suggest the existence 
of a large body, dubbed Planet Nine. 


Planet Nine’s mass is 
estimated to be between five 
and ten times that of Earth. 


Earth 


Planet Nine 


/'—SEDNA 
Closest approach to 
the Sun (perihelion): 
76 astronomical Units (AU) 


/ 


Neptune 


would have to be on the larger end ofits estimated size range, or be very 
reflective or in a relatively close-in orbit. If the planet is too small, dark 
and far away, it may never be seen from Earth. “It’s worse than looking 
for aneedle in a haystack,” says Malhotra. “It’s like looking for the broken 
tip ofa needle in a haystack.” 


THE STORY OF A PLANET 

A more fundamental question is not whether Planet Nine exists, but 
what distant objects say about planetary evolution more generally. Dis- 
coveries such as Sedna and 2012 VP, ,; have forced a radical rethinking 
of the gravitational forces that shape the outer parts of the Solar System. 

When astronomers first started to find Kuiper belt objects in the 
1990s and recognized that Pluto was just another member of that clan, 
they began to paint a picture of this mysterious realm of space. The 
Kuiper belt seemed to extend neatly from about 30 au to 50 Au, with 
most objects following stately orbits around the Sun. Those that were 
a bit odd — travelling off-kilter to the plane of the Solar System, or 
occasionally to greater distances — could be explained by gravitational 
interactions with Neptune. 

Sedna and 2012 VP,,; do not fit that simple model because they range 
too far from the Sun to have ever interacted much with Neptune. Theo- 
rists suddenly had to confront the question of how these objects reached 
their current orbits. All known planets in the Solar System, along with 
the Kuiper belt objects, are thought to have condensed from a disk of gas 
and dust that swirled around the newborn Sun 4.6 billion years ago. But 
Sedna and other objects beyond the main Kuiper belt probably weren't 
born where they are today, because there simply wasn't enough gas and 
dust available at those great distances to create sizeable worlds. 

One idea is that they were tossed there by a gravitational battle with 
other protoplanets closer to the Sun during the first tens of millions of 
years of the Solar System's existence. A second theory holds that the grav- 
ity of a passing star tugged on the outer bits of the planet-forming disk, 
pulling nascent planets into elongated orbits, where they remain today. 

If Planet Nine exists, it could complicate this picture even more. It 
would mean that the orbits of Sedna and 2012 VP,,, were not fixed 
early on but are being actively shaped — even today — by the gravita- 
tional tugs of Planet Nine. That would require theorists to rewrite their 
ideas about how the Solar System’s many worlds have interacted with 
one another over the past 4.6 billion years. “It’s hard to anticipate what 
direction our imaginations will go,’ says Malhotra. 

Understanding the distant Kuiper belt could also help astronomers 
to work out how our Solar System compares with planetary systems 


a 2012 VPii3 
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The hypothesized planet 
travels in an elongated 
orbit that takes 20,000 
years for one circuit. 


as PLANET NINE 
Perihelion: ~200 au 
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around other stars. Brown notes that one of the most common types of 
exoplanet is one missing from our Solar System, a world more massive 
than Earth but less massive than Neptune — that is, around the range 
of the hypothesized Planet Nine. “Maybe we can see what this most 
common type of planet might actually look like,” he says. 

For now, scientists’ best shot at answering these questions is to find 
more extremely distant worlds. And that is why Sheppard and Trujillo 
keep plugging away in Chile and Hawaii, having covered less than 10% 
of the sky that they intend to survey. 

Back on Mauna Kea, Sheppard pushes through the night of observ- 
ing, clocking one field after another with no break. By 4:45 a.m., the 
atmosphere above the summit is turning a little more opaque, and he 
begins to shift the exposure times longer. Finally, at 5:25 a.m., he turns 
to the videoconferencing unit and calls to his colleague in Hilo. “Chad, 
are you there?” he asks. “All the fields are in.” The skies above Subaru 
are beginning to brighten, although Sheppard does not get to enjoy 
the spectacular view of a Hawaiian sunrise because he does not step 
outside. He is busy tallying his 33 fields for the night. Any one of them 
could contain a new extreme Kuiper belt object — or even a Planet Nine. 

It is after 7 a.m. when the observing team tumbles into two sports- 
utility vehicles and drives the steep, rocky road down from Mauna Kea’s 
summit. Sheppard starts to flag only when he sits down for breakfast 
at the astronomers dorm, 1,360 metres lower down on the mountain. 
He and Tholen gulp down their food and retreat to black-curtained 
dormitories to sleep until noon. 

Sheppard, aged 40, has asked his doctor about eye strain and whether 
he will be able to keep looking at star fields forever. He and Trujillo have 
a self-appointed goal of finding ten inner-Oort-cloud objects, a num- 
ber that they think will enable them to start testing ideas for how these 
objects formed and evolved. 

That means many more long nights at the telescope. “If it turns into 
postage-stamp collecting, we'll stop, Sheppard says. “But right now, 
every new discovery is a huge difference-maker in trying to understand 
what's going on out there.” m 


Alexandra Witze writes for Nature from Boulder, Colorado. Not one to 
pull an all-nighter, she took a nap at Subaru. 
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THE RED-HOT DEBATE ABOUT 


TRANSMISSIBLE 
ALZHEIMER'S 


A controversial study has suggested that the 
neurodegenerative disease might be transferred 
from one person to another. Now scientists are 

racing to find out whether that is true. 


BY ALISON ABBOTT 


nthe 25 years that John Collinge has studied neurology, he has seen 
hundreds of human brains. But the ones he was looking at under the 
microscope in January 2015 were like nothing he had seen before. 
He and his team of pathologists were examining the autopsied 
brains of four people who had once received injections of growth 
hormone derived from human cadavers. It turned out that some of the 
preparations were contaminated with a misfolded protein — a prion 
— that causes a rare and deadly condition called Creutzfeldt-Jakob 
disease (CJD), and all four had died in their 40s or 50s as a result. But 
for Collinge, the reason that these brains looked extraordinary was not 
the damage wrought by prion disease; it was that they were scarred in 
another way. “It was very clear that something was there beyond what 
youd expect,’ he says. The brains were spotted with the whitish plaques 
typical of people with Alzheimer’s disease. They looked, in other words, 
like young people with an old person's disease. 

For Collinge, this led to a worrying conclusion: that the plaques might 
have been transmitted, alongside the prions, in the injections of growth 
hormone — the first evidence that Alzheimer’s could be transmitted 
from one person to another. If true, that could have far-reaching impli- 
cations: the possibility that ‘seeds’ of the amyloid-f protein involved in 
Alzheimer’s could be transferred during other procedures in which fluid 
or tissues from one person are introduced into another, such as blood 
transfusions, organ transplants and other common medical procedures. 

Collinge felt a duty to inform the public quickly. And that’s what he 
did, publishing the study in Nature in September’, to headlines around 
the world. “Can you CATCH Alzheimer’s?” asked Britain’s Daily Mail, 
about the “potentially explosive new study”. Collinge has been careful 
to temper the alarm. “Our study does not mean that Alzheimer’s is 
actually contagious,” he stresses. Carers won't catch it on the job, nor 
family members, however close. “But it raises concern that some medi- 
cal procedures could be inadvertently transferring amyloid-f seeds.” 
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Since then, the headlines have died away, but the academic work 
and discussion have taken off. Could seeds of amyloid-f proteins 
really be transmitted and, if so, are they harmless or do they cause 
disease? And could seeds of other related diseases involving mis- 
folded proteins be transmitted in a similar way? In the past decade or 
so, evidence has been mounting for a controversial theory that rogue 
proteins, known collectively as amyloids and associated with diverse 
neurodegenerative diseases — from Alzheimer’s to Parkinson’s and 
Huntington’s — might share some properties of prions, including 
their transmissibility. Collinge’s data bolstered that theory. 

Urgent though these questions are, it could take years to find 
answers. The paper by Collinge and his colleagues has sparked a 
worldwide hunt for similar amyloid pathology in autopsied brains, 
and a small study’ published this January revealed a handful of related 
cases. Researchers are also trying to work out what the putative amy- 
loid seeds look like, and whether different ‘strains’ of amyloids exist 
that are particularly damaging. 

Some researchers say that it is much too early to be alarmed. They 
point out that the number of patients in Collinge’s study was tiny, 
that none had displayed symptoms of Alzheimer’s disease before 
their death and that another toxic protein called tau also seems to 
be required to cause the condition. “We have to remember that there 
is no conclusive evidence that seeds of amyloids can transmit actual 
disease or that amyloids spread in the brain in a prion-like way,’ 
says Pierluigi Nicotera, scientific director of the German Centre for 
Neurodegenerative Diseases in Bonn. “There may be other biological 
explanations.” 

Right now, there are few solid answers, but plenty of concerns. 
The sceptics worry that they might one day find themselves working 
under tight biosecurity regulations to handle proteins that they view 
as relatively innocuous. Others feel that the dangers may have been 


underestimated, and that scientists have a duty to investigate this as 
quickly as they can. “In my opinion, all amyloids should be consid- 
ered dangerous until proven safe,’ says prion and amyloid researcher 
Adriano Aguzzi at the University Hospital Zurich in Switzerland. 


DANGEROUS FOLDS 

A few decades ago, it was almost inconceivable that a protein, which has 
no genetic material or any other obvious way to self-replicate, could cause 
infectious disease. But that changed in 1982, when Stanley Prusiner, now 
at the University of California, San Francisco, introduced evidence for 
disease-causing prions, coining the term from the words ‘proteinacious’ 
and ‘infectious’. Prusiner showed that prion proteins (PrP) exist ina 
normal cellular form, and ina misfolded infectious form. The misfolded 
form causes the normal protein to also misfold, creating a cascade that 
overwhelms and kills cells’. It cause animal brains to turn into a spongy 
mess in scrapie, a disease of sheep, and in bovine spongiform encepha- 
lopathy (BSE or ‘mad cow disease’), as well as in human prion diseases 
such as CJD. 

Prusiner and others also investigated how prions could spread. They 
showed that injecting brain extracts containing infectious prions into 
healthy animals seeds disease’. These prions can be so aggressive that 
in some cases, simply eating infected brains is sufficient to transmit dis- 
ease. For example, many cases of variant CJD (vCJD) are now thought 
to have arisen in the United Kingdom in the 1990s after people ate meat 
from cattle that were infected with BSE. 

Since then, scientists have come to appreciate 
that many proteins associated with neurodegen- 
erative diseases — including amyloid-f and tau 
in Alzheimer’s disease and a-synuclein in Parkin- 
son's disease — misfold catastrophically. Struc- 
tural biologists call the entire family of misfolded 
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proteins (including PrP) amyloids. Amyloid-f clumps into 
whitish plaques, tau forms ribbons called tangles and a-synuclein 
creates fibrous deposits called inclusions. 

A decade ago, these similarities prompted neuroscientist 
Mathias Jucker at the University of Tubingen in Germany to 
test whether injecting brain extracts containing misfolded 
amyloid-f into mice could seed an abnormal build-up of amy- 
loid in the animals’ brains. He found that it could, and that it 
also worked if he injected amyloids into the muscles*. “We saw 
no reason not to believe that if amyloid seeds entered the human 
brain, they would also cause amyloid pathology in the same 
way, says Jucker. 

This didn’t cause alarm at the time, because it wasn't clear how 
an amyloid seed from the brain of someone with Alzheimer’s 
could be transferred into another person's body and find its way 
to their brain. To investigate that, what was needed was a group 
of people who had been injected with material from another per- 
son, and the opportunity to examine their brains in great detail, 
preferably when they were still relatively young and before they 
might have spontaneously developed early signs of Alzheimer’s. 

The CJD brains provided just that opportunity. Between 1958 
and 1985, around 30,000 people worldwide received injections of 
growth hormone derived from the adrenal glands of cadavers to 
treat growth problems. Some of the preparations were contami- 
nated with the prion that causes CJD. Like all prion diseases, CJD 
has a very long incubation period, but once it gets going it rages 
through the brain, destroying all tissue in its wake and typically 
killing people from their late 40s onwards. According to 2012 
statistics®, 226 people around the world have died from CJD as 
a result of prion-contaminated growth-hormone preparations. 

Collinge had not set out to find a link with Alzheimer’s — it 
emerged as part of routine work at the National Prion Clinic in 
London, which he heads, and where around 70% of all people 
in the United Kingdom who die from prion-related causes are 
now autopsied. The clinic routinely looks for signs of all amyloid 
proteins in these brains to distinguish prion disease from other 
conditions. It was thanks to this routine work that the cluster of 
unusual cases emerged of people who had clearly died of CJD, but 
who also had obvious signs of amyloid pathology in their grey 
matter and cerebral blood vessels. 

As soon as he saw these brains, Collinge knew that he could 
get into stormy waters. Keen to strike a balance between warning 
of a possible public-health risk and causing unwarranted panic, 
he sketched a carefully worded press release that would go out 
from the National Prion Centre and set up hotlines for people 


“ALL AMYLOIDS SHOULD BE CONSIDERED 
DANGEROUS UNTIL PROVEN SAFE.” 


who had been treated with growth hormone in the past. But no 
panic occurred: apart from one or two overwrought headlines, 
the news stories were fairly measured, he says. Only around ten 
people called the hotlines. 

For scientists, however, the paper was a red flag. “As soon as the 
paper came out we realized the health implications and started 
collecting slides and paraffin blocks from patients,” says Jiri Safar, 
director of the National Prion Disease Pathology Surveillance 
Center at Case Western Reserve University in Cleveland, Ohio. 
Like other pathologists in countries where people had died of CJD 
associated with medical procedures, he rushed to check the cen- 
tre’s archives of autopsied brains to see if any of them contained 
the ominous amyloid deposits. 

The answers are not yet in. Safar says that it has not proved easy 
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to trace brain samples in the United States, but that he is working 
to do so with the National Institutes of Health and the Centers 
for Disease Control and Prevention (CDC) in Atlanta, Georgia. 
Charles Duyckaerts at the Pitié-Salpétriére Hospital in Paris, 
France, has now examined brain tissues from around 24 patients 
and is likely to report the results later this year. 

A further 228 cases of CJD were caused by transplantation of 
prion-contaminated dura mater — the membrane surrounding 
the brain and spinal cord — prepared from cadavers around the 
world. Dura-mater preparations were regularly used in brain 
surgery as repair patches until the late 1990s. For the study” pub- 
lished in January, Herbert Budka at the National Prion Diseases 
Reference Center at University Hospital Zurich and his colleagues 
examined the brains of seven such patients from Switzerland and 
Austria, and found that five had amyloid deposits in grey mat- 
ter and blood vessels. In Japan, dementia researcher Masahito 
Yamada at Kanazawa University is making his way through a large 
number of such autopsy specimens and says that the 16 brains he 
has examined so far show signs of unusually high levels of amyloid 
deposition in cerebral blood vessels. 

Yet such case studies can only ever provide circumstantial 
evidence that seeds of amyloid-B were transferred during the 
treatments. And they cannot entirely rule out the possibility that 
the treatments themselves — or the patients’ original medical 
conditions — caused the amyloid pathology. More-conclusive 
evidence would come from checking whether the original growth 
hormone and dura-mater preparations contained infectious amy- 
loid seeds, by injecting them into animals and seeing whether this 
triggers disease. Most of these preparations, however, have long 
since disappeared. Collinge has access to some original samples 
of growth hormone stored by the UK Department of Health, and 
he is planning to analyse them for the presence of amyloid seeds 
and then inject them into mice. That work will take a couple of 
years to complete, he says. 


SEEDS OF DOUBT 
There is another hitch: no one knows for sure what size and shape 
the amyloid seeds might be. Jucker is hunting for them in an unu- 
sual source of human brain tissue that has nothing to do with 
CJD. A team in Bonn has collected frozen samples from more 
than 700 people with epilepsy who were operated on over the past 
25 years to remove tissue that was driving their seizures. “It is the 
best source of fresh human brain tissue available at the moment; 
says Jucker, who plans to scrutinize it carefully under the micro- 
scope for anything that might resemble tiny clumps or seeds of 
amyloid-f. The team also has records of the patients’ cognitive 
skills, such as language and memory skills, before and at regular 
intervals after the operations. This should allow Jucker’s team to 
correlate the presence of any amyloid-f seeds it finds with changes 
in the cognitive function of individual patients over time. 

Scientists have shown that tau and a-synuclein can also seed 
pathological features in mice. In two studies’* from 2012, sci- 
entists injected fibrils of a-synuclein into the brains of mice 
already engineered to develop some of the characteristics of 
Parkinson’s disease. This triggered the early onset of some of 
the signs and symptoms of Parkinson’s, and eventually killed 
the animals. A third study’ showed that similar injections into 
normal mice caused some of the neurodegeneration typical of 
Parkinson’s disease and the mice became less agile. In humans, 
a-synuclein would not necessarily turn out to be equally aggres- 
sive — mouse models of neurodegenerative diseases do not 
mimic human disease very closely — but scientists are taking 
the possibility seriously. 

If the transmissibility hypothesis proves true, the implications 
could be severe. Amyloids stick like glue to metal surgical 
instruments, and normal sterilization does not remove them, so 
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amyloid seeds might possibly be transferred during surgery. The 
seeds might sit in the body for years or decades before spreading 
into plaques, and perhaps enabling the other pathological changes 
needed to induce Alzheimer’s disease. Having amyloid plaques in 
cerebral blood vessels could be dangerous in another way, because 
they increase the risk that the vessel walls might break, leading 
to small strokes. 

But if common medical procedures really increased the risk 
of neurodegenerative disorders, then wouldn't that already have 
been detected? Not necessarily, says epidemiologist Roy Ander- 
son at Imperial College London. “The proper epidemiological 
studies have not been done yet,” he says. They require very large 
and carefully curated databases of people with Alzheimer’s dis- 
ease, which include information about the development of symp- 
toms and autopsy data. He and his team are now studying the 
handful of reliable databases that exist to tease out a signal that 
might associate medical procedures with Alzheimer’s progres- 
sion. The number of patients currently available may turn out to 
be too small to draw conclusions, he says, but a more definitive 
answer could emerge as the databases grow. 

Faced with so much uncertainty, some researchers and public- 
health agencies have adopted a wait-and-see approach. “We are 
right at the beginning of this story,’ says Nicotera, “and if there is 
one message to come out right now it is that we need more work 
to see if this is a relevant mechanism. The CDC and the European 
Centre for Disease Prevention and Control in Solna, Sweden, say 
that they are keeping a cautious eye on the issue. 

If further research does confirm that common neurodegenera- 
tive diseases are transmissible, what then? One immediate prior- 
ity would be rigorous sterilization procedures for medical and 
surgical instruments that would destroy amyloids, in the way that 
extremely high temperatures and harsh chemicals destroy prions. 
Aguzzi says that funding agencies should put out calls now to 
researchers to develop cheap and simple sterilization methods. 
“Tt’s not very sexy science, but it is urgently needed,” he says. He 
also worries about the safety of researchers working with amy- 
loids — particularly a-synuclein. “I have nightmares that some- 
one in my lab may catch Parkinsons,” he says. “While the story is 
in flux, our first duty is to protect lab workers.” 


STRAIN SEEKERS 

The similarities between prions and other amyloids is throwing 
open other avenues of research. Prions can exist as distinct 
strains — proteins that have the same sequence of amino acids 
but misfold in different ways and have distinct biological behav- 
iours’’, much as different strains of a pathogenic virus can be 
aggressive or weak. The outbreak of vCJD in the United Kingdom 
in the 1990s was traced to BSE-contaminated meat because the 
prion strain was the same in both. 

Over the past few years, research in animals has shown that 
different strains of amyloid-f and a-synuclein exist’. Anda 
landmark paper’ in 2013 reported that strains of amyloid-B 
with different 3D structures were associated with different 
disease progression in two people with Alzheimer’s. Structural 
biologist Robert Tycko, who led the work at the National Insti- 
tute of Diabetes and Digestive and Kidney Diseases in Bethesda, 
Maryland, is now looking at many more brain samples from 
such patients. 

Knowing the structures of pathological forms of amyloid 
seeds should help to design small molecules that bind to them 
and stop them doing damage, says biophysicist Ronald Melki 
at the Paris-Saclay Institute of Neuroscience, who works on 
a-synuclein strains. His lab is designing small peptides that 
target the seeds and mimic regions of ‘chaperone’ molecules, 
which usually bind to proteins and help them to fold correctly. 
Melki’s small peptides mimic these binding regions, sticking 


Alzheimer’s, primarily a disease of elderly people, has complex roots. 


to the amyloid proteins to stop them from aggregating further. 

In the research community, much of the agitation in response 
to Collinge’s paper boils down to semantics. Some scientists do 
not like to use the word ‘prion’ in connection with the amyloids 
associated with common neurodegenerative diseases, or to 
describe any of their properties as ‘prion-like’ — because of its 
connotation of infectious, deadly disease. “The public has this 
perception of the word ‘prion,’ says Alzheimer’s researcher Brad 
Hyman at Harvard Medical School in Boston, Massachusetts, and 
this matters, even if their ideas are wrong. “One of my patients 
told me that she wasnt getting any hugs any more from her hus- 
band who had read about the case in the media — that made me 
sad,’ he says. 

Others, however, feel that it is helpful to consider prions and 
other amyloids as being part of a single spectrum of conditions 
involving proteins that misfold and misbehave. It means that 
researchers studying prion diseases and neurodegenerative 
diseases, who until recently had considered their disciplines to 
be separate, now find themselves tackling shared questions. 

Both fields are wary of raising premature alarm, even though 
they wonder what the future will bring. Jucker, only half-jokingly, 
says he could imagine a future in which people would go into 
hospital every ten years or so and get the amyloid seeds cleared 
out of their brains with antibodies. “Youd be good then to go for 
another decade.” m 


Alison Abbott is Nature’ senior European correspondent. 


Jaunmuktane, Z. et al. Nature 525, 247-250 (2015). 

Frontzek, K., Lutz, M. |., Aguzzi, A., Kovacs, G. G. & Budka, H. Swiss Med. 
Wkly 146, w14287 (2016). 

Prusiner, S. B. Science 216, 136-144 (1982). 

Chiesa, R. PLoS Pathog. 11, e1004745 (2015). 
eyer-Luehmann, M. et al. Science 313, 1781-1784 (2006). 
Brown, P. et al. Emerg. Infect. Dis. 18, 901-907 (2012). 
Mougenot, A. L. et a/. Neurobiol. Aging 33, 2225-2228 (2012). 
Luk, K. C. et al. J. Exp. Med. 209, 975-986 (2012). 

Luk, K. C. et al. Science 338, 949-953 (2012). 

0. Aguzzi, A., Heikenwalder, M. & Polymenidou, M. Nature Rev. Mol. 
Cell Biol. 8, 552-561 (2007). 

11. Cohen, M.L. et al. Brain 138, 1009-1022 (2015). 

12. Peelaerts, W. et al. Nature 522, 340-344 (2015). 

13. Lu, J-X. et al. Cell 154, 1257-1268 (2013). 


Nor 


SO @ SOV CIP OO 


17 MARCH 2016 | VOL 531 | NATURE | 297 


© 2016 Macmillan Publishers Limited. All rights reserved 


FEATURE | NEWS 


“THERES NO CONCLUSIVE EVIDENCE THAT SEEDS 
OF AMYLOIDS CAN TRANSMIT ACTUAL DISEASE.” 
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Team up with industry 


Combining commercial and academic incentives and resources 
can improve science, argues Aled Edwards. 


r | Ahe scientific community is bustling 
with projects to make published 
results more reliable. Efforts are 

under way to establish checklists, to revamp 
training in experimental design, and even to 
fund disinterested scientists to replicate oth- 
ers experiments. A more efficient strategy 
would be to rework current incentives to put 
less emphasis on high-impact publications, 
but those systems are entrenched, and public 
funders and universities are ill-prepared for 
that scale of change. 

To catalyse change, industry must step up 
to the plate. I have learned this first hand, 
as head of the Structural Genomics Consor- 
tium (SGC), a research charity funded by 


business, government and other charities. 
If more companies contributed funds and 
expertise to efforts such as ours, I believe it 
would create a system that rewards science 
that is both cutting-edge and reproducible. 

Here I share my experience of running a 
collaboration between academic and indus- 
trial researchers over the past 12 years, and 
distil the principles that have made for reli- 
able research. 


THEN AND NOW 

In the late 1990s, several drug compa- 
nies concluded that academic researchers 
were only half-heartedly pursuing struc- 
tural information about human proteins, 
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information crucial to the design of new 
drugs. They proposed pooling resources 
with government bodies to fund this 
work. And so, in 2004, drug company 
GlaxoSmithKline, the major UK biomedi- 
cal funder the Wellcome Trust and science 
funders from Canada launched the SGC. 
Our task? To generate human protein struc- 
tures and place them in the public domain. 
The motivation varied. Public and 
charitable funders emphasized scientific 
novelty and high-quality publications. 
Industry wanted research relevant to drug- 
discovery efforts. For example, companies 
insisted on structures of human proteins 
rather than those from other species, 
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> even though many of these would have 
been of great scientific interest. All funders 
demanded quantifiable milestones, unre- 
stricted use of data and reagents, and the 
right to withdraw support from the pro- 
ject if it underperformed (see ‘Milestones 
exceeded’). 

The number of drug companies par- 
ticipating has grown to eight. The SGC 
currently disburses more than US$20 mil- 
lion to 250 scientists in 6 dedicated labo- 
ratories. Our researchers include principal 
investigators, postdocs, technicians and 
graduate stu- 


dents. Atany given “Done well, 
time, as many industry - 

as 50 scientists @eqdemic 

in industry col-  ¢g[laborations 
laborate with SGC could recreate 
scientists. Pro- the engine 
jects now extend that powered 
beyond protein successful 
structures to the 
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ical probes (small 

molecules useful for studying protein func- 
tion), and to hospital-based collaborations 
that explore the effects of these tools in cells 
from patients. All the resources we create are 
non-proprietary and readily available. 

In 2010, as the SGC began to attract the 
interest of pharmaceutical companies that 
had never before participated in consortia 
dedicated to producing public information, 
asked, “Why us?’, hoping to hear about our 
amazing scientific intellect. The answer was 
more prosaic: “We can repeat your work.” 
Back then, before reports emerged that 
fewer than half of biomedical papers could 
be reproduced (L. P. Freedman et al. PLoS 
Biol. 13, €1002165; 2015), that did not make 
much sense to me. It does now. 

Ironically, it was our work with industry 
that had helped make our research so reliable 
in the first place. Our industrial colleagues 
helped to design processes to increase the 
chances that they could depend on our work. 
From the outset, we were operating within a 
system in which continued funding was tied 
to research that proved useful. 


EIGHT PRINCIPLES 

In my opinion — shared by my long- 
standing industrial collaborators — several 
mutually reinforcing factors characterize 
this research system. Each is essential. It 
is the combination, rather than any single 
principle, that is key. 


Require full commitment, and reward 
efficiency. Focus and organization are 
necessary for success. As a precondition 
for receiving funding, scientists agree to 
dedicate all their research time to the pro- 
ject. This is the aspect in which the SGC 
differs most from other schemes involving 
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academic partnerships with industry. For 
participating scientists, the advantage is 
efficient, predictable funding: as long as 
scientists achieve their milestones, expenses 
are covered without the need for grant appli- 
cations. It also incentivizes scientists to 
innovate. If they achieve their targets with- 
out using all their available funds, they can 
use the remainder to pursue their curiosity. 


Define objectives that cannot be achieved 
with current technology. Many scientists 
and public funders believe that formal 
milestones — a requirement for industry 
investment — are the antithesis of discovery 
research. The solution is to create ‘stretch 
goals’ In our case, funders provided a list 
of 2,000 human proteins and a directive to 
solve the structures of 350 — knowing full 
well that that goal was not achievable with 
the technologies of the day. This worked. To 
meet these and subsequent milestones, SGC 
scientists developed new methods and have 
published more than 800 peer-reviewed 
papers (of which 60 have appeared in Nature 
and its eponymous sister journals). 


Establish clear quality criteria and make 
them public. Milestones must be unambig- 
uous, or else they can be gamed. In our case, 
we asked a group of independent experts 
from academia and industry to craft quan- 
titative criteria to judge research outputs. 
For instance, we defined how different a 
protein sequence needed to be from others 
in the Protein Data Bank to count as ‘novel; 
and specified acceptable levels of precision 
and error. All these criteria were published 
on our website (see go.nature.com/4qncn)j). 
SGC scientists knew that protein structures 
would not be ‘counted’ unless they met or 
exceeded the publicly available criteria. 


Mandate data sharing. Mechanisms for 
sharing progress and enforcing transpar- 
ency promote reproducibility. We rolled 
out electronic lab notebooks in 2004. There 
was some initial resistance, but scientists 
quickly realized that ready access to infor- 
mation from their colleagues helped their 
own research. Electronic lab notebooks 
make it easier to document and dissemi- 
nate detailed experimental procedures, and 
also reduce the risk that researchers will 
cherry-pick data. 


Subject work to independent oversight 
before public release. Participants should 
not be the judges of whether they have met 
a milestone. We created an independent 
advisory board of academic and indus- 
try scientists to assess the quality of each 
structure, probe, or other tool before it 
was released into the public domain. This 
external body prevents us from loosening 
the quality criteria for research outputs if 
achieving the original goals turns out to be 
harder than expected. 

This oversight was more tedious and 
humbling than we were used to. Consor- 
tium scientists had to prepare documents 
defending how each criterion had been met, 
and external scientists who intend to use the 
potential tool tend to be tougher judges of 
robustness than the average peer reviewer. 

Over the years, however, external vetting 
became simply part of our practice. With- 
out a doubt, it contributed to our reputation 
for reproducible and meaningful research. 
On two occasions, SGC scientists were con- 
vinced that a chemical probe (see go.nature. 
com/sediul) had met our quality criteria, 
only to have an oversight committee raise 
what turned out to be legitimate concerns, 
which were subsequently addressed. 
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MILESTONES EXCEEDED 


Numbers of structures deposited by the 
Structural Genomics Consortium in the Protein 
Data Bank are consistently above set targets. 


1,600... ® Cumulative target milestones 7 
®@ Achieved milestones 
1,400 ... 


1 — 


1,000-- 
200). are 


Human protein structures 


2007 2011 2015 


Enshrine public ownership for all 
research outputs. The value of foun- 
dational science is greater when more 
people use the research, so materials must 
be disseminated as freely as possible. We 
avoided legal encumbrances such as for- 
mal Material Transfer Agreements (MTAs) 
where possible, developed ‘click-wrap’ 
MTAs where necessary, and encouraged 
vendors to provide all our reagents with- 
out restriction. The result is that many sci- 
entists outside the SGC use our materials. 
Currently, the SGC contributes 5% of all 
the plasmids deposited at the repository 
Addgene; last year, vendors received more 
than 5,000 orders for chemical probes 
generated by the SGC. 


Ensure that industry and academic 
scientists collaborate. Industry must 
provide expertise as well as funds. Collabo- 
ration with industry scientists engenders a 
shared desire to succeed, and creates a sense 
of ownership ofa project. The different moti- 
vations also create productive tension. For 
example, scientists in academia have strong 
incentives to publish rapidly. Unfortunately, 
this can lead to the publication of stories that 
are true only under narrowly defined condi- 
tions. By contrast, industry scientists push 
for validation using a range of orthogonal 
experiments; these alternative ways of evalu- 
ating the same research tool ensure that the 
results are broadly useful. Publications and 
release must happen by consensus. By bal- 
ancing these desires, we achieve an optimal 
combination of innovation, timely dissemi- 
nation and reproducibility. 


Create an active governing body. Many 
academic projects, including a number 
that are co-funded by industry, create 
‘friends and family’ governance bodies that 
advise, but do not govern. The SGC opted 
for a different model. Our board includes 


senior executives from every major funder. 
It can halt a project, change leadership and 
direct strategy. Any meaningful change 
to the project budget requires unanimous 
board approval. Because of this responsibil- 
ity, in-person attendance has been nearly 
100% for every quarterly board meeting 
for more than 12 years. A productive pub- 
lic-private-sector tension also plays out at 
the board. If industry requests a delay in 
releasing research results to further review 
data quality, public funders balance the sci- 
entific merits of this request with the need 
to disseminate the output rapidly. 


EXPAND THE MODEL 

Adherence to these principles builds an 
ecosystem that supports reproducible, 
innovative research. Scientific publications 
are no longer the sole units of achieve- 
ment; reaching predefined milestones 
and making useful tools are also key to 
continued funding. 

This approach cannot work for every 
scientific question or for every scientist. 
But there are many important scientific 
questions in which the long-term needs 
of industry and the talents of academic 
researchers are aligned. Here the research 
effort could be readily focused within 
a single, organized project where unre- 
stricted access to the research results 
would benefit everyone. The oil and gas 
sector might pool resources to develop 
novel remediation technologies; the for- 
estry sector might create a consortium to 
explore cellulose-degrading technologies; 
the renewables sector might collaborate to 
identify better energy-storage technologies. 

Such public-private collaborations are in 
industry’s interest. Companies need foun- 
dational science to innovate and for long- 
term profitability. In the past, researchers at 
BASE, Dupont and Bell Labs invented tech- 
nologies that transformed science, business 
and daily life — from transistors to radar to 
synthetic fertilizers — and they won Nobel 
prizes. Today, support for internal company 
research has been slashed, and academic 
research is not filling the gap because it 
is unpredictable, unfocused and, often, 
unreliable. Done well, industry—academic 
collaborations could recreate the engine that 
powered successful enterprises. 

Such partnerships should be more 
widespread. They can advance important 
areas of research. Equally important, the 
research community gains knowledge 
and tools, and the practices that make 
for robust science might diffuse beyond 
the collaborations to raise the quality of 
science overall. = 


Aled Edwards is chief executive of the 
Structural Genomics Consortium. 
e-mail: aled.edwards@utoronto.ca 
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=< 
Bones from the US Civil War, on display at the Miitter Museum in Philadelphia, Pennsylvania. 


Ethics of exhibition 


David Hurst Thomas explores the controversies over 
collections of human remains and plundered artefacts. 


hree months before the 1941 attack 
ik Pearl Harbor, anthropologist 

Margaret Mead published a prescient 
piece in Natural History entitled ‘Museums in 
the Emergency: Mead found the US citizenry 
“suspicious of every means of communica- 
tion” — except museums. She attributed this 
remarkable faith to the museum practice of 
asking, “Is this true?” rather than, “Will this 
make a hit?” A stubborn insistence on truth, 
Mead believed, could keep the museum as “a 
place in which [people] can renew their trust 
in science and in democracy”. 

Two books look at truth and trust in the 
museum world. In Bone Rooms, historian 
Samuel Redman tracks the evolving role of 
collections of human remains and their pub- 
lic display in framing issues of race. Cultural 
sociologist Tiffany Jenkins crafts a spirited 
read in Keeping their Marbles, with much 
to offer regarding the genesis of the world’s 
great museum collections. She transports the 
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reader from the Napoleonic campaigns that 
stocked the Louvre in Paris with Egyptian 
treasures to British imperialists funnelling 
global booty to London's British Museum. 
Many countries now want their treasures 
back; Nigeria, for instance, wants the return of 
its bronzes, taken when the British Army flat- 
tened the then-kingdom of Benin in the late 
nineteenth century. Both books explore the 
question of who owns the past, with remark- 
ably different answers. Keeping their Marbles 
advocates maintaining the finders-keepers 
mentality that created the museum collec- 
tions. Bone Rooms argues that human remains 
were sometimes inappropriately acquired in 
the name of science, and that meaningful 
steps must be taken to redress the balance. 
Redman documents the US ‘skull wars’ 
of the late nineteenth century, when muse- 
ums competed to collect human skulls, 
whole skeletons, mummies and fossils. 
Battlefield casualties became fair game, as 
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Bone Rooms: From Scientific Racism to 
Human Prehistory in Museums 

SAMUEL J. REDMAN 

Harvard University Press: 2016. 


Keeping Their Marbles: How the Treasures 
of the Past Ended Up in Museums and Why 
They Should Stay There 

TIFFANY JENKINS 

Oxford University Press: 2016. 


did archaeological sites, Native American 
cemeteries and indigenous people unfortu- 
nate enough to pass away at a World’s Fair. 
Even isolated body parts were accessioned, as 
bizarrely illustrated by the Civil War veteran 


who found his own amputated arm in the | 


Army Medical Museum in Washington DC. 

Bone Rooms details the nascent views of 
racial science that evolved in US natural his- 
tory, anthropological and medical museums. 
These debates spilled into public museum 
spaces, arraying human bodies in some- 
times controversial, even macabre, exhibits. 
Redman effectively portrays the remarkable 
personalities behind them, particularly pit- 
ting the prickly Ales Hrdli¢ka at the Smith- 
sonian Institution in Washington DC against 
ally-turned-rival Franz Boas at the American 
Museum of Natural History in New York 
City. Debates over racial science should have 
evaporated when Boas conclusively demon- 
strated that language, culture and biology 
(‘race’) are independent — the premise of 
modern anthropology. But the myth of scien- 
tific racism (typically involving questionable 
concepts such as ‘racial essence’ and ‘racial 
genius’) persisted for a century more, due in 
part to Hrdlicka’s powerful influence over 
widely attended exhibitions that promoted 
the now-discredited idea that races are immu- 
table, with evolutionary change transpiring 
only within (not between) human races. 

Bone Rooms also highlights ethical 
concerns over collecting, curating and exhib- 
iting human remains that simmered in the 
1930s and boiled over after the Vietnam War. 
Native Americans increasingly expressed 
shock at the tens of thousands of ancestral 
skeletons appropriated without their permis- 
sion or, usually, knowledge. The Native Amer- 
ican Graves Protection and Repatriation Act 
of 1990 required museums to inventory hold- 
ings of human remains (plus potential sacred, 
patrimonial and funerary artefacts), then to 
consult tribes about cultural affiliations and 
time and manner of repatriation. Although 
most human remains stayed in storage, tens of 
thousands were repatriated. A few hotly dis- 
puted cases linger on, including Kennewick 
Man (an 8,400-year-old skeleton now held 
in limbo at the Burke Museum of Natural 
History and Culture in Seattle, Washington). 
Redman concludes (correctly, in my view) 
that, on balance, repatriation programmes 
are “successful steps forward”. 

Keeping their Marbles views museum truth 
and trust differently. Jenkins does an excellent 
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job of portraying the extreme reactions 
elicited by repatriation conversations — from 
the smug ‘we-stole-it-fair-and-square’ to the 
angst parodied by historian Elazar Barkan as 
‘performance guilt’ (in which “leaders the- 
atrically say sorry for acts from the past for 
which they had no responsibility”). Although 
granting that great museum collections “were 
wrenched from their original contexts by 
means that often amounted to theft”, Jen- 
kins bristles at returning items. Rather, she 
stresses three principles — preservation, truth 
and access — that determine what is best for 
objects, scholars and the public. “The mis- 
sion of museums,’ she argues, “should be to 
acquire, conserve, research, and display their 
collections... That is all and that is enough.” 
I don't believe that is enough — particularly 
with respect to human remains. As a practis- 
ing dirt archaeologist, I still on occasion exca- 
vate human burials. As a museum curator, I 
sometimes exhibit human remains. But as a 
museum-based researcher, I acknowledge 
my responsibilities to consult meaningfully 
with relevant descendant and stakehold- 
ing communities — and listen to what they 
say. Jenkins is correct that repatriation will 
render some human remains unavailable for 
research. The public display of certain human 
remains is likewise often inappropriate. These 
are limitations that many of us accept today. 
The alternative — the free-ranging, 
science-tiber-alles mentality articulated 
in Keeping Their Marbles — reprises the 
cavalier attitudes towards communities of 
descendants that characterized Americanist 
archaeology for most of the nineteenth and 
twentieth centuries. That sordid legacy, 
which necessitated reburial and repatriation 
legislation in the first place, seems particu- 
larly inappropriate for the responsible prac- 
tice of twenty-first-century science. What 
of Margaret Mead’s belief in modern truth 
and trust? Today’s headlines target different 
‘Museums in the Emergency, from the sys- 
tematic looting of the National Museum in 
Baghdad to the Islamist terrorist group ISIS 
taking sledgehammers to Syrian antiqui- 
ties. The prominent Syrian scholar Khaled 
al-Asaad was beheaded by ISIS for refusing 
to disclose where ancient treasures from 
Palmyra had been hidden for safe keeping. 
The right of museums to hold and display 
collections is today contested at every turn. 
Modern museums have multiple meanings, 
objectives and constituencies. But one thing 
is certain: nowhere is there now a museum 
where all people “can renew their trust in sci- 
ence and in democracy”. No matter where 
the Parthenon Marbles end up, that museum 
world has become an artefact of the past. = 


David Hurst Thomas is a curator of 
anthropology at the American Museum of 
Natural History, New York City. 

e-mail: thomasd@amnh.org 


Books in brief 


Eruption: The Untold Story of Mount St. Helens 

Steve Olson W. W. NORTON (2016) 

When Mount St Helens in Washington state erupted in 1980, it woke 
the nation from political and economic torpor. The huge sideways 
blast — a pyroclastic flow — killed 57, triggered the largest landslide 
ever recorded and spewed ash over 11 states and several Canadian 
provinces. Steve Olson intercuts stories of victims including David 
Johnston, the volcanologist who was monitoring the explosion, with 
an account of its impact on science — such as popularizing the use 
of lidar. With 1,500 potentially active volcanoes worldwide, this is an 
urgent reminder of the need for advances in the field. 


And the Sun Stood Still: A Play 

Dava Sobel BLOOMSBURY (2016) 

The centrepiece of science writer Dava Sobel’s acclaimed 2011 
history A More Perfect Heaven (Bloomsbury) is a dramatized telling of 
a crucial meeting: the 1539 encounter between Nicolaus Copernicus 
and German mathematician Georg Joachim Rheticus, who would 
broker the publication of the Polish astronomer’s great treatise on 
heliocentrism, De Revolutionibus. Now reworked as a play, Sobel’s 
imaginative exploration of how Rheticus convinced the “starry canon” 
to air his theory is a revelation of world-shifting science illuminating 
the human mind, leavened with a sparkling immediacy. 


When We Are No More: How Digital Memory Is Shaping Our Future 
Abby Smith Rumsey BLOOMSBURY (2016) 

A door is opening on a frightening prospect: the future of history. So 
notes scholar Abby Smith Rumsey in this erudite treatise on how 

the digitization of archival technology makes it all too easy to rewrite 
our cultural past. She analyses our journey in recorded memory, 
interweaving neuroscience with a history of the archive, and ranging 
from classical mnemonic devices to the collective amnesia that can 
follow the destruction of libraries. Books, she shows, are “memory 
machines” that we have learned to manage. Digitized data in toto is a 
different beast — and one bucking under our attempts at control. 


Blood Oil: Tyrants, Violence, and the Rules That Run the World 
Leif Wenar OXFORD UNIVERSITY PRESS (2016) 

Petroleum is truly globalized. But for exporting countries such as 
Nigeria, high-demand raw materials can be a “resource curse”, 
linked to political corruption and socio-economic inequality (see 
J. Vidal Nature 482, 306; 2012). In this straight-talking manifesto, 
philosopher Leif Wenar draws on economics and political science 
to call for a rethink on global supply chains. Clean trade policies to 
protect public property and accountability are needed, he argues, 
if poorer nations are to achieve resource sovereignty and Western 
importers are to stop buying blindly into oppressive regimes. 


Throwing Rocks at the Google Bus 

Douglas Rushkoff PORTFOLIO (2016) 

Technology writer Douglas Rushkoff delivers an incisive analysis 

of digitized culture in this shrewd study of the economic rot at its 
heart. Issues such as the corporate growth model and “platform” 
monopolies are, he shows, threatening the public good. He suggests 
that the rage of protestors who attacked shuttle buses carrying 

Google employees in 2013 would be better channelled into “digital 
distributism” — an economy that hinges on democratic ownership of 
the means of production, cooperatives and genuine sharing. Barbara Kiser 


17 MARCH 2016 | VOL 531 | NATURE | 303 
© 2016 Macmillan Publishers Limited. All rights reserved 


| COMMENT | BOOKS & ARTS 


ENTOMOLOGY 


Nabokov’s scientific artistry 


Vladimir Lukhtanov delights in a treatise on the luminary’s contribution to biology. 


ladimir Nabokov’s influ- 

\ / ence on Russian and Eng- 

lish literature and language 

is assured. Many people also know 

of the novelist’s lifelong passion for 

butterflies. But his notable contribu- 

tions to the science of lepidopterol- 

ogy and to general biology are only 
beginning to be widely known. 

Nabokov was no amateur ento- 
mologist. He served for six years as 
curator of the butterfly collection 
at Harvard University’s Museum 
of Comparative Zoology in Cam- 
bridge, Massachusetts, and pub- 
lished a dozen papers on taxonomy 
— the description and classification 
of organisms — that remain impor- 
tant. His observations on butterfly morphol- 
ogy have stimulated breakthrough research 
in evolutionary biology. Several of his origi- 
nal biogeographic hypotheses have been 
confirmed in the past few years. Fine Lines, 
a collection edited by Stephen Blackwell 
and Kurt Johnson, explains the importance 
of Nabokov’s scientific work and traces its 
influence on his novels. 

The book begins with 154 of Nabokov’s 
black-and-white and colour drawings of 
butterflies’ fine anatomical structures. 
Most represent the European, Asian and 
American species of the ‘blues’ of the tribe 
Polyommatini, Nabokov’s favourite group. 
Ten essays follow, by prominent research- 
ers including evolutionary biologist James 
Mallet, current Harvard butterfly curator 
Naomi Pierce and lepidopterist Robert Pyle, 
explaining the interplay of science and art in 
Nabokov’s writings. Fine Lines clearly dem- 
onstrates the significant impact that science 
had on Nabokov’s evolution as a writer. 

The decision to open the book with the 
drawings is a masterstroke. They illustrate one 
of the most important aspects of Nabokov’s 
creativity — his tremendous attention to 
details, described with scrupulous preci- 
sion. In his novels, he seamlessly marshals 
minutiae — impressions, passing fancies, 
ideas — to create a universe strongly rooted 
in observation. The particular or apparently 
trivial was, for him, always worth probing. In 
his entomological studies, he analysed fine, 
nearly invisible, dots on the wings of New 
and Old World butterflies to hint at what 
may have happened on Earth millions of years 
ago. With no palaeontological data, Nabokov 
speculated that North and then South Amer- 
ica were populated by five waves of butterflies 
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One of Vladimir Nabokov’s drawings of the undersurfaces of butterfly wings. 


migrating from Asia (V. Nabokov Psyche 52, 
1-61; 1945) — a picture confirmed by DNA 
analysis almost 70 years on (R. Vila et al. Proc. 
R. Soc. B278, 2737-2744; 2011). 

This pointillism is harder than it seems. 
Piling up millions of elements can easily end 
in chaos; to create a picture, one needs to 
understand the nature of these elements and 
to be able to choose between them. The core 
of scientific drawing differs greatly from pho- 
tography in focusing on the heart of the mat- 
ter and avoiding unnecessary details. This is 
important for science, and no less for art. 
Both have the same central goal — to reveal 
an unknown or invisible essence of things. 
That is one of the main points of Fine Lines. 

Yet science and art diverge in their com- 
munication. In science, the ability to convey 
the idea properly and simply is a matter of 
special talent, but almost everyone can learn 
to do it. Not so in art. Nabokov’s drawings are 
scientifically perfect, but also staggeringly 
fine aesthetically. They 
show how the merging 
of content and form 
in art conveys ideas 
wonderfully. However, 
even the most wonder- 
ful idea becomes banal 
ifartistry is lacking. 

The personal, 
artistic and scientific 


aspects of Nabokov’s Fine lines: 
life were tightly inter- Vladimir 
twined. As one ofthe Nabokov’s 
book’s essayists, sci- Scientific Art 
ence writer Dorion "O1D BY STEPHEN 
H. BLACKWELL AND 
Sagan, concludes,  jijer jounson 
nature andart werea yale University Press: 
continuum for him: 2016. 
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“the distinct but equally necessary 
paths of art and science seem to scale 
opposite sides of the same majestic 
mountainscape”. 

Nabokov’s fiction is permeated by 
science, as Fine Lines amply reveals. 
He was a master in the use of motif 
and symbols. In his novel Lolita 
(Olympia, 1955), for instance, the 
town Lepingville is named after 
‘lepping, butterfly hunters’ slang for 
chasing butterflies, and Elphinstone 
after Elphinstonia, a subgenus in the 
white butterfly genus Euchloe. The 
fictional play-within-the-novel, The 
Enchanted Hunters, is built almost 
entirely on symbols associated with 
butterflies. Diana, its protagonist, 
is both the virgin goddess of hunting and 
a butterfly species (Speyeria diana). In his 
essay, Nabokov scholar Brian Boyd reveals 
that Edusa Gold, who directs the play, is an 
echo of Colias edusa, an old but preocccupied 
name for the clouded yellow (now Colias cro- 
ceus). I can add that her sister Electra Gold 
was named after Colias electra, an unavailable 
name for the African clouded yellow, now 
Colias electo. That these names are effectively 
hidden — no longer in use, but buried in lists 
of unavailable scientific epithets — chimes 
with the secrecy in this controversial novel. 

I prepared this review at the Nabokov 
House Museum in St Petersburg, Russia. 
While there, I discovered in the Nabokov 
family’s copy of An Illustrated Natural 
History of British Butterflies and Moths by 
Edward Newman (William Glaisher, 1870) 
that Nabokov had, as a child, coloured in the 
black-and-white image of the clouded yellow 
with remarkable accuracy. As zoologist Victor 
Fet describes in Fine Lines, Nabokov’s child- 
hood concentration on butterfly collecting 
and drawing effectively provided very specific 
training in memory and paying attention, as 
well as that focus on minute detail. 

Few have so beautifully and meaningfully 
meshed serious scientific endeavour with 
artistic brilliance, visual and verbal. Fine 
Lines helps us to understand the phenom- 
enon of creativity, without which neither 
good science nor true art can exist. m 


Vladimir Lukhtanoyv is leading research 
scientist at the Zoological Institute of 

the Russian Academy of Sciences in 

St Petersburg, and professor of entomology 
at Saint Petersburg State University. 
e-mail: lukhtanov@mail.ru 
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Short-sighted to cut 
environment posts 


Australia’s premier government- 
funded science agency, CSIRO, is 
shedding jobs in environmental 
science — 100 scientists from 
the climate-sciences division 
alone face imminent job loss. The 
perception that environmental 
research is unprofitable has 
already rendered it victim 

to four years of government 

cuts. Evidently, the promise 

of significant savings from 
science-based environmental 
remediation has yet to resonate 
with decision-makers. 

Other major CSIRO job losses 
affect research on sustainable 
management of the nation’s 
terrestrial and aquatic ecosystems 
and biodiversity. James Cook 
University in Queensland intends 
to axe 25% of its academic staff 
in the environmental sciences. 
Effective research organizations 
such as Land and Water Australia 
have been disbanded. And past 
cuts have already translated 
into weakened environmental 
regulation and management. 

Australia is vulnerable to 
climate-change effects and has 
rapid population growth. It leads 
the world in recent extinctions 
of terrestrial mammals, and has 
vast areas that are in urgent need 
of restoration after widespread 
intensive land clearing, livestock 
overgrazing and mining. 

Anti-environment policies 
and further destruction of 
Australia’s research capability are 
threatening to destroy its priceless 
natural heritage. 

David Lindenmayer The 
Australian National University, 
Canberra, Australia. 
david.lindenmayer@anu.edu.au 


China draws lines to 
green future 


To conserve the strategic integrity 
of its environment, China has 
drawn up a system of ‘Red Lines: 
These denote the total minimum 
areas of various land-use types 
nationally and regionally, without 


specifying their exact locations. 
Coming after Red Lines that 
were created to protect cropland 
and forest habitats, the latest 
Red Line will safeguard China's 
vast biodiversity, environmental 


resources and ecosystem services. 


This could consolidate the shift 
in the country’s environmental 
strategy, which is moving away 
from networks of protected 
areas and short-term ecological 
restoration towards longer-term 


conservation of entire landscapes. 


We propose that the area 
marked by the latest Red Line 
(see go.nature.com/na6ry6; in 
Chinese) should equal at least 
496 million hectares. This would 
incorporate the areas covered by 
China’ existing nature-reserve 
network and three recent, 
overlapping, landscape-scale 
conservation schemes. Within 
these, ‘priority biodiversity 
conservation areas, ‘important 
ecosystem-function areas’ and 
‘key ecosystem-service function 
areas have been designated for 
flood protection, erosion control, 
biodiversity conservation and 
ecosystem-service provision. 

Direct government payments 
for ecosystem-service provisions 
and adjustments to imbalances 
in designations, particularly in 
China’s eastern provinces, should 
underpin this initiative. 

Sang Weiguo Minzu University 
of China; and Institute of Botany, 
Chinese Academy of Sciences, 
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Six principles for 
EU peer review 


The European Union Agencies 
Network for Scientific Advice 
(EU-ANSA) is a group of 11 EU 
agencies that provides scientific 
information for institutions and 
national authorities in Europe. It 
has recently assessed its peer- 
review practices and drawn up 
guidelines that we hope will 
support the agencies’ work and 
contribute to the debate on 


changes to peer-review methods, 
particularly when there are 
implications for policymaking. 

The context in which EU 
agencies provide scientific advice 
and technical support is highly 
specific. It requires a special 
approach to peer review that 
addresses particular challenges, 
such as ensuring reviewers’ 
motivation, independence and 
international perspective. 

The EU-ANSA collaboration 
has outlined six guiding 
principles (see go.nature. 
com/bqb7jp). These focus on: 
defining the process so that it 
is widely accepted; recruiting 
high-quality expertise; achieving 
credibility and avoiding bias 
using a transparent process; the 
provision of adequate resources; 
the availability of adequate 
technical support; and integration 
with established agency processes 
for ensuring good quality and 
performance in work. 

William Cockburn European 
Agency for Safety and Health at 
Work, Bilbao, Spain. 

Hubert Deluyker European Food 
Safety Authority, Parma, Italy. 
cockburn@osha.europa.eu 


Hasty publication 
compromises rigour 


The period between submitting 
a paper and its publication can 
sometimes exceed the time 
invested in achieving the results 
(see Nature 530, 148-151; 
2016). Meanwhile, acquiring 
funding still hinges on clocking 
up publications in journals 
with high impact factors. These 
pressures pose a difficult choice 
for researchers, especially those 
in the early stages of their careers. 
To compete for a faculty 
position or an independent grant, 
a PhD student must publish a 
handful of papers and then keep 
the ball rolling as a postdoc with 
at least one paper every year. 
The importance of publishing 
fast is hammered home during 
this period when, in our view, 
the emphasis should instead be 
on the reproducibility of results. 


In this battle for survival and 
growth, it is scientific rigour that 
might pay the price. 

Even improvements in 
performance indicators — using 
citation counts, for example — 
are of little help to an early-career 
researcher. We need to move 
away from the ‘publish or perish’ 
ethos and towards incentives that 
reward scientific quality. 
Shraddha Madhav Karve Indian 
Institute of Science Education and 
Research, Pune, India. 
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Biodiversity central 
to food security 


At the fourth plenary session 
of the Intergovernmental Panel 
on Biodiversity and Ecosystem 
Services (IPBES) last month, 
the thematic assessment on 
sustainable biodiversity use was 
referred for a second scoping 
by experts. I suggest that the 
new analysis needs to include 
biodiversity’s contribution 

to ecosystem services that 

are essential to agricultural 
sustainability and food security. 

Agricultural ecosystems are 
directly linked to human and 
environmental health (D. Tilman 
and M. Clark Nature 515, 518- 
522; 2014). They are essential to 
the IPBES ambition to influence 
progress on the United Nations 
Sustainable Development Goals. 
Furthermore, by 2050, the food 
supply for 9.6 billion people will 
depend on the sustainable use of 
agricultural biodiversity and its 
multiple ecosystem services (see 
go.nature.com/7ympnb). 

A narrow scope that focuses 
on harvesting wild, uncultivated 
species will fail to capture 
biodiversity’s importance to 
ecosystem services. Instead, we 
need a systems-based approach 
(see J. Liu et al. Science http://doi. 
org/627; 2015). 
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Dry-season greening of Amazon forests 


ARISING FROM D. C. Morton et al. Nature 506, 221-224 (2014); doi:10.1038/nature13006 


Evidence from ecological studies!?, eddy flux towers* >, and satellites*® 


shows that many tropical forests ‘green up’ during higher sunlight 
annual dry seasons, suggesting they are more limited by light than 
water. Morton et al.’ reported that satellite-observed dry-season green 
up in Amazon forests is an artefact of seasonal variations in sun-sensor 
geometry. However, here we argue that even after artefact correction, 
data from Morton et al. show statistically significant increases in canopy 
greenness during the dry season. Integrating corrected satellite with 
ground observations indicates that dry-season forest greening is 
prevalent in Amazonia, probably reflecting large-scale seasonal 
upregulation of photosynthesis by canopy leaf dynamics. There is 
a reply to this Brief Communication Arising by Morton, D. C. et al. 
Nature 531, http://dx.doi.org/10.1038/nature16458 (2016). 

Variations in sun-sensor geometry induce artefacts in remotely 
sensed vegetated surfaces®. Satellite studies thus typically use models 
to correct artefacts (for example, Moderate Resolution Imaging 
Spectroradiometer (MODIS) leaf area index”, and multiangle imple- 
mentation of atmospheric correction (MAIAC) enhanced vegetation 
index'® (EVI)) or compositing algorithms designed to minimize 
artefacts (standard MODIS EVI'!). Morton et al.” used a modelling 
approach to correct MODIS satellite data, which they state removed 
seasonal changes in surface reflectance, and redefined debates over 
how climate controls forest productivity in the Amazon. Setting aside 
arguments that the remote sensing analysis by Morton et al. is faulty”, 
we take their correction’ at face value, and ask two questions. 

First, we ask whether the corrected results support their core 
conclusion that dry-season green up, previously observed by MODIS 
EVI, is eliminated. The hypothesis that Amazon forests green up 
in the dry season? can be rigorously evaluated by formal statistical 
tests. Morton et al.” showed that their correction reduces estimated 
dry season green up, AEVI (the EVI change during the dry season, 
AEVI = October EVI — June EVI; figure 3 in ref. 7 and Fig. 1). As the 
corrected mean AEVI was smaller than an a priori estimate of error for 
individual EVI observations, they concluded that the corrected mean 
AEVI was indistinguishable from zero. We find that this comparison, 
however, is not appropriate for assessing whether corrected EVI can 
resolve a basin-wide green up. The correct comparison, of mean 
AEVIto the error of the mean of the whole population of observations, 
is accomplished with standard statistical tests that lever the prob- 
ability theory ‘law of large numbers’!’. For example, the 95% confi- 
dence interval’? for basin-wide mean of corrected AEVI significantly 
excludes zero (Fig. 1). Alternatively, the corrected AEVI distribution’ 
can be compared to the binomial distribution generated by the null 
hypothesis that pixels are equally likely to exhibit positive or negative 
AEVI (Fig. 1), which is analogous to treating ‘green up’ or ‘brown down’ 
as the outcome of the flip of a fair coin. 

These standard tests show that corrected AEVI’, though substan- 
tially smaller in magnitude than uncorrected, nonetheless shows a 
highly significant increase in forest greenness. 

Second, we ask whether the smaller, but statistically significant, green 
up seen in the data from Morton et al. (Fig. 1) is biologically meaningful 
in terms of consistency with mechanisms and magnitude of seasonal 
changes in canopy-scale biophysics observed on the ground. We find 
that at an intensively measured site, significant dry-season increases 
in leaf area index are driven by coordinated flushing of new leaves, 
which have higher near-infrared reflectance (Fig. 2a) (mechanisms 
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Figure 1 | Distribution of uncorrected and corrected AEVI. 
Reproducing figure 3b of ref. 7, with 95% confidence interval (shaded blue 
region), significantly excluding zero. We conservatively assume that only 
relatively large areas (1° x 1°, or ~10* MODIS pixels) are independent, 
giving 158 independent 1° x 1° patches that include valid pixels, and 95% 
confidence interval: AEVI + ZSaARyj = 0.016 + 0.0027, where Z is 1.96 
(the 95% Z-score), and Sxgvy is the standard error of AEVI (derived from 
AEVI standard deviation as Sagyr / 158). The probability of observing 
84.2% positive values (or heads from fair coin flips) out of 158 
observations is P< 10~'5 (binomial test). 


that Morton et al.’ hypothesized could drive true increases in satellite- 
observed EVI). Leaf flushing is followed, after 1 to 2 months, by 
increases in photosynthetic capacity derived from CO, fluxes measured 
at eddy flux towers (Fig. 2a). This correlation—1-month-lagged photo- 
synthetic capacity with leaf area index, r=+0.90, and with MAIAC EVI, 
r=-+0.89, where r is Pearson’s correlation coefficient, and the time lag 
is for new leaves to develop their photosynthetic capacity't—establishes 
a link between eddy flux measurements and biophysical properties 
observable from satellites. 

On the basis of this link, we find that increases in dry-season 
greenness seen by corrected EVI products (whether those of ref. 7 or 
the MAIAC EVI of Lyapustin et al.'°; Fig. 2b) are real and consistently 
correlated with photosynthetic capacity increases seen at towers within 
the region analysed by Morton et al. (including adjustment for possible 
sun-angle effects on canopy illumination). This suggests that even the 
smaller corrected AEVI’ reflects mechanisms of canopy changes actu- 
ally observed on the ground, and is therefore biologically meaningful. 

The analysis in Morton et al.’ is, notably, stimulating a productive 
re-examination of the methodology, meaning and magnitude of remote 
sensing indices, their artefacts, and their relation to field studies on the 
ground®!*. However, we believe that the primary substantive finding 
of Morton et al. of consistent canopy structure and greenness is incor- 
rect. Both satellite remote sensing and ground-based observations show 
dry-season increases in greenness and biophysical properties associated 
with canopy photosynthesis across scales, from individual leaves to 
ecosystems to regions, in support of the conclusion that Amazon forests 
green up with sunlight in the dry season*"4. 
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Figure 2 | Seasonality of vegetation metrics. a, Corrected EVI (from 
extended data figure 7 of ref. 7; red ‘X’ in June and October), and average 
cycle of corrected MAIAC EVI, a product that also corrects sun-sensor 
geometry (to nadir view, and 45° sun angle; black line)"; leaf area index 
(dark green)'; percentage of trees with new leaves (light green)'°; and 
tower-derived photosynthetic capacity (PC/PCmax, blue line; see Methods), 
all from site K67 (ref. 4). BRDFE, bidirectional reflectance distribution 
function. b, Green up at four tower sites for (i) EVI (corrected as ina 
and uncorrected); (ii) MAIAC EVI (corrected and uncorrected); and 
(iii) tower-derived photosynthetic capacity (with and without fixed 

solar zenith angle (SZA), showing potential effects of changing solar 
illumination). Sites: K34 (Manaus), K67 and K83 (Santarém), and CAX 
(Caxiuana National Forest, near Belém (CAX had insufficient data for 
fixed SZA analysis))*. All uncertainties are 95% confidence intervals. 


Methods 

For basin-wide analysis, we analysed AEVI, corrected on a per-pixel basis, in 
197,651 valid pixels (from figure 3 of ref. 7, data courtesy of D. C. Morton). For 
tower comparisons, we averaged valid pixels from both extended data figure 7 of 
ref. 7 (corrected with a simplified approach that retrieved AEVI for more area, 
~2 x 10° pixels, including around towers), and from the independent sun-sensor 
geometry corrected MAIAC EVI product”, in 5km grids around towers (11 km 
around CAX, to obtain sufficient data; Fig. 2b). Tower-based 95% confidences are 
boot-strapped June-October changes in photosynthetic capacity. Photosynthetic 
capacity is eddy-flux-derived gross primary productivity (from ref. 4) averaged 
under reference environmental conditions, an estimate of photosynthetic 
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infrastructure independent of environment and (with additional binning) sun 
angle. Reference bins were: light (1,350 + 200 zmol m~’s~1), vapour pressure defi- 
cit (980 + 200 Pa), relative irradiance (observed/clear-sky expected = 0.6 + 0.1), 
and solar zenith angle (25°-35°). 


Scott R. Saleska}, Jin Wu?, Kaiyu Guan?, Alessandro C. Araujo, 
Alfredo Huete’, Antonio D. Nobre® & Natalia Restrepo-Coupe* 
lEcology and Evolutionary Biology, University of Arizona, Tucson, Arizona 
85721, USA. 

email: saleska@email.arizona.edu 

Environmental and Earth System Science, Stanford University, Stanford, 
California 94305, USA. 

3Embrapa Amazonia Oriental, Belem, Brazil. 

4Plant Functional Biology and Climate Change Cluster, University of 
Technology Sydney, Sydney, New South Wales 2007, Australia. 

5National Institute for Space Research (INPE) and National Institute for 
Amazonian Research (INPA), Sao José dos Campos, Brazil. 


Received 21 November 2014; accepted 15 October 2015. 


1. Borchert, R. Soil and stem water storage determine phenology and distribution 
of tropical dry forest trees. Ecology 75, 1437-1449 (1994). 
2. Wright, S. J. & Van Schaik, C. P. Light and the phenology of tropical trees. 
Am. Nat. 143, 192-199 (1994). 
3. Huete, A. R. et a/. Amazon rainforests green-up with sunlight in dry season. 
Geophys. Res. Lett. 33, L06405 (2006). 
4. Restrepo-Coupe, N. et a/. What drives the seasonality of productivity across the 
Amazon basin? A cross-site analysis of eddy flux tower measurements from 
the Brasil flux network. Agric. For. Meteorol. 182-183, 128-144 (2013). 


5. Shuttleworth, W. J. Evaporation from Amazonian rain forest. Proc. R. Soc. Lond. 
B Biol. Sci. 233, 321-346 (1988). 
6. Guan, K. et al. Photosynthetic seasonality of global tropical forests constrained 


ion 


y hydroclimate. Nature Geosci. 8, 284-289 (2015). 

de orton, D.C. et al. Amazon forests maintain consistent canopy structure and 

greenness during the dry season. Nature 506, 221-224 (2014). 

Jupp, D. L. B. & Strahler, A. H. A hotspot model for leaf canopies. Remote Sens. 

Environ. 38, 193-210 (1991). 

9. Myneni, R. et al. Global products of vegetation leaf area and fraction absorbed 

PAR from year one of MODIS data. Remote Sens. Environ. 83, 214-231 (2002). 

10. Lyapustin, A. |. et a/. Multi-angle implementation of atmospheric correction for 

ODIS (MAIAC): 3. Atmospheric correction. Remote Sens. Environ. 127, 

85-393 (2012). 

uete, A. et al. Overview of the radiometric and biophysical performance of the 

ODIS vegetation indices. Remote Sens. Environ. 83, 195-213 (2002). 

i, J. et al. Sunlight mediated seasonality in canopy structure and 

hotosynthetic activity of Amazonian rainforests. Environ. Res. Lett. 10, (2015). 

heskin, D. J. Handbook of Parametric and Nonparametric Statistical Procedures 

h edn (Chapman & Hall/CRC, 2007). 

14. Wu, J. et al. Leaf development and demography explain photosynthetic 
seasonality in Amazon evergreen forests. Science 351, 972-976 (2016). 

15. Brando, P. M. et al. Seasonal and interannual variability of climate and 
vegetation indices across the Amazon. Proc. Nat! Acad. Sci. USA 107, 
14685-14690 (2010). 


w 


MND W ot 


Author Contributions S.R.S. designed the statistical analysis and wrote the 
initial draft of the paper; J.W. implemented the statistical analysis; K.G. provided 
MAIAC EVI data and analysis; A.C.A. contributed eddy flux data; A.H. contributed 
analysed EVI data; A.D.N. contributed eddy flux data and insights; and N.R.C. 
integrated the multi-tower datasets. All authors contributed to writing the final 
version. 


Competing Financial Interests Declared none. 


doi:10.1038/naturel16457 


© 2016 Macmillan Publishers Limited. All rights reserved 


BRIEF COMMUNICATIONS ARISING 


Morton et al. reply 


REPLYING TO S. R. Saleska et a/. Nature 531, http://dx.doi.org/10.1038/nature16457 (2016) 


Multiple mechanisms could lead to upregulation of dry-season 
photosynthesis in Amazon forests, including canopy phenology and 
illumination geometry. We specifically tested two mechanisms for 
phenology-driven changes in Amazon forests during dry-season 
months, and the combined evidence from passive optical and lidar 
satellite data’ was incompatible with large net changes in canopy leaf 
area or leaf reflectance suggested by previous studies”. We therefore 
hypothesized’ that seasonal changes in the fraction of sunlit and shaded 
canopies, one aspect of bidirectional reflectance effects in Moderate 
Resolution Imaging Spectroradiometer (MODIS) data, could alter 
light availability for dry-season photosynthesis and the photosynthetic 
capacity of Amazon forests without large net changes in canopy 
composition. Subsequent work supports the hypothesis that seasonal 
changes in illumination geometry and diffuse light regulate light 
saturation in Amazon forests®’. These studies clarify the physical mech- 
anisms that govern light availability in Amazon forests from seasonal 
variability in direct and diffuse illumination. Previously, in the debate 
over light limitation of Amazon forest productivity, seasonal changes in 
the distribution of light within complex Amazon forest canopies were 
confounded with dry-season increases in total incoming photosyntheti- 
cally active radiation”**. In the accompanying Comment’, Saleska et al. 
do not fully account for this confounding effect of forest structure on 
photosynthetic capacity. 

Saleska et al.? investigated one of the three lines of evidence in our 
paper to argue that near-zero seasonal changes in corrected MODIS 
enhanced vegetation index (EVI) are actually non-zero (figure 1 in 
ref. 9; 0.071 to 0.016, a 77% reduction). Following this logic, our data 
also show a small but statistically significant decrease in normalized 
difference vegetation index (NDVI; extended data figure 4 in ref. 1), 
a pattern that we attributed to residual artefacts from changes in 
sun-sensor geometry, as no leaf-level mechanism for increased forest 
productivity generates opposing responses in these vegetation indices 
(see supplementary discussion in ref. 1). Indeed, the comparison 
between NDVI and EVI responses is a useful diagnostic tool’ that could 
have been used to investigate residual bidirectional reflectance effects in 
multiangle implementation of atmospheric correction (MAIAC) data 
(figure 2 in ref. 9). 

In isolation, MODIS data provide limited insight into the mecha- 
nisms for seasonal changes in Amazon forests’. MODIS EVI is pri- 
marily sensitive to changes in near-infrared reflectance!*'°, not 
photosynthetically active radiation absorption that drives forest 
productivity. Saleska et al. misrepresent data from extended data 
figure 7 of ref. 1 as fully corrected in their figure 2 (ref. 9), and further 
confound seasonal changes through spatial averaging of 1 km? data 
over large regions (25-121 km?). A previous study using 1 km? data for 
these same tower sites shows little or no seasonality in MAIAC EVI"! 
(see supplementary figure 5 in ref. 1). 

One of the key messages from our study was the need for careful 
attention to uncertainty in satellite-based measurements of forest sea- 
sonality. The presentation of in situ and satellite data by Saleska et al. 
(figure 2a in ref. 9), and the MAIAC product in general, could be 
improved with quantitative estimates of uncertainty to support 
assertions of forest seasonality. 


Subtle variability in canopy structure and reflectance properties 
of Amazon forests remains a key area for further study, particularly 
with large-scale field studies!’, to better understand the spatial and 
temporal heterogeneity of leaf phenology strategies in Amazonia”. 
Other mechanisms for seasonal changes in photosynthetic capacity 
also merit further investigation, including how diurnal and seasonal 
variability in illumination alter the distribution of photosynthetically 
active radiation at the leaf level*”'4. NASA satellite data remain an 
important foundation for future research on tropical forest dynamics, 
within the limits of calibration, measurement, and model uncertainty 
that can be realistically achieved with space-based sensors. 
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MATERIALS SCIENCE 


How crystals get an edge 


Microscopy reveals how nanowires of a widely used semiconductor grow during preparation. The findings will allow the 
crystal phases of such nanowires to be engineered — a crucial advance for materials science. SEE ARTICLE P.317 


ANNA FONTCUBERTA | MORRAL 


he twinkling facets of single crystals 
To mineral collections have inspired 

breathtaking artwork and magnifi- 
cent buildings, such as the Sagrada Familia 
in the Catalan city of Barcelona, Spain. Yet 
our understanding of the nucleation (the 
formation of tiny crystal seeds at the start of 
crystallization) and growth of even the small- 
est filamentary crystals, including nanowires, 
is fragmentary. This slows the implementa- 
tion of nanowire-based technology in many 
areas of science, including electronics, pho- 
tonics and quantum information. On page 
317 of this issue, Jacobsson and colleagues' 
show how to control the crystal form adopted 
by nanowires at nucleation, thereby allow- 
ing the structure to be tailored for particu- 
lar functions, and solving one of the most 
fundamental challenges in materials science 
and engineering. 

Polytypism is the ability of a compound to 
exist as various crystal forms that differ only in 
terms of the sequence in which ‘bilayer struc- 
tures’ of atoms stack in a given direction within 
a crystal lattice. Bilayer structures are arrange- 
ments of atom pairs connected by bonds, and 
can be thought of as subunits from which all the 
possible crystal types of a compound can be 
constructed. An archetype of polytypism in 
bulk crystals is that of silicon carbide (SiC), for 
which three bilayer structures (defined as A, B 
and C) are needed. Silicon carbide has more 
than 200 polytypes, corresponding to differ- 
ent combinations of the bilayers’. For cubic 
crystal systems, including the cubic struc- 
tures adopted by silicon carbide and gallium 
arsenide (GaAs, a widely used semiconductor), 
the most common polytypes correspond to the 
zinc-blende (ABCABC) and wurtzite (ABAB) 
crystal structures. 

The polytype formed is determined at 
the nanometre scale during nucleation. At 
this scale, the high surface-to-volume ratio 
of crystals can stabilize polytypes and specific 
crystal phases that are unstable in the bulk 
form. The formation of polytypes in nanowires 
is especially intriguing. Nanowires are often 
made using the vapour-liquid-solid (VLS) 
method, in which a liquid-metal droplet acts 
as a catalyst that gathers gaseous precursors 


a 
Droplet 
Newly 
forming 
layer 
Nanowire 


Figure 1 | Nanowire growth. In the vapour-liquid-solid method, a liquid-metal droplet acts as a catalyst 
that transforms gaseous precursors (not shown) into a crystalline solid that grows as a nanowire beneath 
the droplet. Different crystal phases can grow, consisting of different sequences of layers that contain 
distinct structural arrangements of atoms; layer types are denoted A, B and C. a, Jacobsson et al.' observed 
that, when the contact angle of the droplet to the nanowire is close to 90°, the liquid—solid interface is flat, 
and favours ABAB stacking (the wurtzite crystal phase). b, Contact angles larger or smaller than 90° result 
in the formation ofa crystal edge that starts at the triple-phase line (TPL, which separates the vapour, 
liquid and solid phases), and favours the ABCABC stacking of the zinc-blende phase. 


and transforms them into a solid that grows 
beneath the droplet (Fig. 1). The growth con- 
ditions determine which polytypes form for a 
given diameter of the nanowire. 

Several theories*° have addressed the 
question of which polytype forms during VLS 
nanowire formation, and have proposed that 
the triple-phase line (TPL, also known as the 
trijunction) — which separates the vapour, liq- 
uid and solid phases in VLS processes — has 
a decisive role. According to these theories, 
nucleation at the TPL favours formation of a 
wurtzite phase, whereas nucleation outside 
the TPL favours the zinc-blende phase. Con- 
trol of whether nucleation occurs at or outside 
the TPL might be achieved by modifying the 
contact angle between the VLS droplet and the 
nanowire, but explicit experimental evidence 
to support this hypothesis has been lacking. 
This was particularly true for the technologi- 
cally interesting family of arsenide and phos- 
phide semiconductors. 

Jacobsson et al. provide the first experi- 
mental evidence of the role of the TPL and the 
contact angle in polytype formation, going 
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beyond simple growth conditions. By watch- 
ing GaAs nanowires grow using a transmission 
electron microscope, they have determined the 
parameters that lead to wurtzite or zinc-blende 
structures. More specifically, they observed the 
changes of crystal phase that occur alongside 
variations in the flux of gaseous GaAs precur- 
sors (trimethylgallium and arsine). 

The authors began by observing nanowire 
growth using VLS conditions that lead to 
formation of the wurtzite phase. They found 
that each new bilayer formed at the TPL, 
creating a complete, flat layer across the top of 
the crystal (Fig. 1a). The liquid droplet became 
larger and richer in gallium as the flux of arsine 
was reduced. 

When the droplet reached a certain size, 
Jacobsson and colleagues observed the sudden 
appearance of an edge — a facet of the nano- 
wire that starts at the TPL within the liquid- 
solid interface (Fig. 1b). The growth dynamics 
of the nanowire then changed drastically. New 
bilayers formed much faster than before, and 
each was accompanied by oscillation of the 
edge — that is, the position of the edge moved 


outwards from the bilayer and then jumped 
back again. Once an edge appeared, further 
growth on the nanowire quickly adopted 
the zinc-blende structure. Previously grown 
wurtzite segments partially dissolved at the 
edge, but regrew as wurtzite. 

Jacobsson et al. propose that the size of 
the droplet determines both its contact angle 
with the nanowire and the morphology of the 
liquid-solid interface. They also suggest that 
contact angles of about 90° are compatible with 
a flat liquid-solid interface and with nuclea- 
tion of bilayers at the TPL. By contrast, smaller 
or larger contact angles lead to the formation of 
an edge at the liquid-solid interface, starting at 
the TPL — which suppresses nucleation of new 
bilayers at the TPL, enabling the zinc-blende 
structure to form. 

These results extend our knowledge far 
beyond the orthodox understanding of poly- 
type formation, and provide a path towards 
crystal-phase design: it should now be possible 
to engineer VLS systems to select the crystal 
phase of GaAs that forms. But can polytypism 
be tailored for different VLS catalysts, or even 
in their absence? The answer could be crucial 
to avoiding the unintentional incorporation of 
impurities from catalysts into nanowires. 

The functional behaviour of classical semi- 
conductor heterostructures® (combinations 
of two different semiconductors) is controlled 
by the change of composition that occurs at 
the interface between the component mater- 
ials — the interface modifies the potential- 
energy landscape of charge carriers such as 
negatively charged electrons and positively 
charged ‘holes’ In his physics Nobel lecture, 
Herbert Kroemer introduced the concept 
of engineering heterostructures to ‘teach 
electrons new tricks’ in electronic and opto- 
electronic applications®. The unintentional 
formation of different polytypes in hetero- 
structures is detrimental to such applications. 
But Jacobsson and co-workers findings about 
how to make polytypes reproducibly mean 
that we can add crystal-phase engineering to 
the list of techniques by which we teach new 
tricks to electrons, holes and their associated 
spins’, and might enable the development of 
new semiconductor applications. m 
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Visionary stem-cell 


therapies 


Stem-cell engineering has allowed successful cornea transplantations in rabbits 
and the regeneration of transparent lens tissue in children, demonstrating the 
therapeutic potential of this approach. SEE ARTICLE P.323 & LETTER P.376 


JULIE T. DANIELS 


o correctly refract light onto the retina at 

the back of the eye, the cornea and lens 

must remain transparent throughout 
life. Treatments for opacification of the cornea 
or lens involve donor transplants or artificial 
implants, respectively, but these procedures 
can be risky. Alternative strategies for treating 
such ocular disorders would be to transplant 
tissue grown in the laboratory from stem cells, 
or to coax resident stem cells to regenerate 
normal tissue in the body. Two papers in this 
issue, by Hayashi et al.' (page 376) and Lin 
et al.’ (page 323), report work that advances 
these possibilities. 

Blindness represents a great clinical and 
economic burden worldwide. Corneal trans- 
plantation has been the gold standard for 
restoring transparency since the first suc- 
cessful transplant in 1905 (ref. 3). However, 
despite the fact that corneal transplants are 
less likely to induce an immune response 


a In vitro 


than transplants to other sites in the body, 
grafts can be rejected by the host body within 
five years’, 

One promising strategy for avoiding 
rejection involves tissue engineering using the 
patient’s own (autologous) cells. This approach 
has proved successful in treating people with 
ocular chemical burns, in whom autologous 
cells called limbal stem cells were used to 
replace the epithelial cells that make up the 
outermost layer of the cornea’. However, 
it is not always feasible to harvest sufficient 
stem cells to grow in the laboratory. As an 
alternative, the conversion of adult cells into 
induced pluripotent stem cells (iPSCs), which 
can develop into any cell type’, could supply 
enough cells for therapy. 

During embryonic development, ocular tis- 
sue is formed from three tissue layers, one of 
which, the surface ectoderm, gives rise to the 
corneal epithelium and lens. Hayashi et al.’ 
grew human iPSCs in vitro under conditions 
that promoted the creation of a structure 


Isolation of 
epithelial 
stem cells 


Clear corneal sheet 


Lens 
removal 


Regeneration 


Figure 1 | Sight for sore eyes. Opacity in the cornea or the lens of the eye can cause blindness, and 
current treatment strategies are less than ideal. a, To produce corneal sheets that could be transplanted 
into rabbits, Hayashi et al.' cultured induced pluripotent stem cells (iPSCs, which can give rise to all body 
cell types) in conditions that promoted the development of a SEAM — a structure containing four distinct 
zones and resembling the embryonic eye. Isolation and further culture of epithelial stem cells from zone 3 
of the SEAM successfully produced corneal sheets for transplantation. b, Opaque lenses are normally 
removed and replaced with artificial implants. Lin et al.” devised a minimally invasive method for lens 
removal, which left lens epithelial stem cells (LECs) intact. These LECs regenerated the lens in rabbits, 
macaques and children, eliminating the need for implants. 
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that they dubbed the self-formed ectodermal 
autonomous multi-zone (SEAM), which 
contained four defined concentric zones 
that in some ways mimicked the developing 
eye. The authors found that different SEAM 
zones contained cells with characteristics of the 
ocular surface ectoderm, the lens, the neuro- 
retina and the retinal pigment epithelium. 

Blocking BMP signalling — an intracellular 
pathway required for the development of 
surface ectoderm cells — abolished zone 3 of 
the SEAM. Hayashi and colleagues tested the 
therapeutic potential of cells from this zone 
by harvesting the cells and selecting those 
that expressed genes characteristic of epithe- 
lial stem cells (Fig. 1a). The authors cultured 
transplantable sheets of corneal epithelium 
from the selected cells, and demonstrated 
that these sheets could restore a healthy cor- 
neal epithelium in rabbits in which corneal 
epithelial stem cells had been experimentally 
depleted. 

Cataracts, which cause sight-threatening lens 
clouding’, are surgically treated by removing 
the lens from its supporting capsule and 
replacing it with an artificial intraocular lens 
(IOL). In children with congenital cataracts, a 
major cause of childhood blindness, the suc- 
cess of IOL implantation is limited”"° — sur- 
gery can cause opacity in the line of vision and, 
because the eye is still growing, it is difficult 
to provide good vision with spectacles. Rather 
than attempting to create a living lens in vitro, 
Lin et al.’ investigated the possibility of regen- 
erating a naturally transparent lens in the body. 

The authors discovered that lens epithelial 
stem/progenitor cells (LECs) expressing the 
genes PAX6 and SOX2 self-renew and dif- 
ferentiate into lens fibre cells that can form a 
3D transparent lens-like structure that refracts 
light. In mice, mutation of the stem-cell- 
maintenance gene Bmi-1, which is expressed in 
LECs, impaired LEC proliferation and induced 
cataract formation. These data led Lin et al. to 
reason that, by refining the technique for surgi- 
cal cataract removal to minimize the damage 
done to LECs in situ, they could promote lens 
regeneration (Fig. 1b). 

In rabbits, the authors’ minimally invasive 
technique led to lens regeneration around 
seven weeks after surgery. The approach 
achieved similar results in macaques, in which 
lens regeneration took several months and no 
complications arose. Finally, the researchers 
performed a clinical trial, in which transpar- 
ent lenses were regenerated in both eyes of 
12 infants within 3 months, all without 
complication. 

These two studies illustrate the remarkable 
regenerative and therapeutic potential of 
stem cells. Hayashi and colleagues’ approach 
involved substantial in vitro cell manipulation 
to obtain a sheet of cultured corneal epithelium 
for transplantation. When considering the 
expense involved in following good manufac- 
turing practices for cell therapies, the current 


protocol is unlikely to be economically viable. 
However, the real value of this research lies in 
the possibility that the SEAM model will facili- 
tate the discovery of fundamental mechanisms 
that underlie the early development of each 
type of ocular tissue. Such an understanding 
might eventually enable in situ manipulation 
of stem-cell populations throughout the eye, as 
Lin et al. have elegantly shown to be achievable 
for the lens. Furthermore, lens regeneration 
might also turn out to be possible in ageing 
adults in whom LEC proliferation has 
declined — for example, research on the 
SEAM could identify small molecules able to 
stimulate such regeneration. 

Whether either of the reported therapies 
will lead to cornea or lens transparency that 
can be maintained in the long term remains 
uncertain. However, these exciting studies take 
us away from simple therapies that involve 
like-for-like replacement of single mature cell 
types, and open up the possibility of thera- 


GLOBAL WARMING 


peutic manipulation of the broader stem-cell 
environment in the eye. m 
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China’s contribution 
to climate change 


Carbon dioxide emissions from fossil-fuel use in China have grown dramatically 
in the past few decades, yet it emerges that the country’s relative contribution to 
global climate change has remained surprisingly constant. SEE LETTER P.357 


DOMINICK V. SPRACKLEN 


to limit the increase in global average 
temperature to less than 2 °C above pre- 
industrial temperatures (see Nature 528, 
315-316; 2015). Meeting this aspiration will 
require large and rapid reductions in green- 
house-gas emissions, making it imperative 
to understand and account for the emissions 
from different countries. China has undergone 
rapid economic development over the past few 
decades and now has one of the world’s largest 
economies — and greenhouse-gas emissions 
to match. On page 357 of this issue, Li et al.’ 
comprehensively assess China's contribution 
to climate change and explore how this has 
altered as the Chinese economy has grown. 
Humans affect Earth’s climate through 
many mechanisms by changing the abun- 
dance of greenhouse gases and air pollutants, 
and by altering the reflectivity of Earth’s sur- 
face through changes in land use. The rela- 
tive strengths of these different drivers can be 
compared through a metric known as radiative 
forcing, which quantifies the impact of each 
process on Earth's energy budget. 
Li et al. used a model that couples 


I: December 2015, world leaders agreed 
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biogeochemistry and climate to estimate 
China’s contribution to global radiative forcing 
over the period 1980-2010. Crucially, they 
account for almost all anthropogenic drivers 
of climate change. They find that China’s rela- 
tive contribution to global radiative forcing 
from carbon dioxide emissions associated with 
fossil-fuel use increased almost threefold in 
these 30 years. This is to be expected, given 
the surge in China’s economy over this period. 
More surprisingly, they find that China’s 
relative contribution to total global radiative 
forcing has remained at 10% over this time. 

To understand the reasons behind this 
remarkable result, Li and colleagues made a 
detailed analysis of the different drivers of radi- 
ative forcing. They found that the air pollutants 
that cause China’s notorious pollution haze 
have had complex effects on climate, counter- 
acting some of the increase in radiative forcing 
from greenhouse gases. Some components of 
air pollution, such as black-carbon particles, 
absorb sunlight and warm Earth’s climate. By 
contrast, sulfate particles scatter light, resulting 
in climate cooling. 

Over the past few decades, China’s rela- 
tive contribution to global radiative forcing 
from sulfate has increased dramatically. This 
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Figure 1 | No net change. Lief al.' report that China's relative contribution to global radiative forcing 

—a measure of how strongly different factors affect climate change — has remained constant over the past 
three decades (broken blue line). The total net effect has several contributors. Carbon dioxide emissions 
from fossil-fuel combustion have increased, as have black-carbon emissions, both of which lead to climate 
warming (positive radiative forcing). Land-use change can also contribute to warming, but the effects 

of this have declined. Conversely, sulfate emissions that cool climate have increased, and the negative 
radiative forcing associated with this has offset that from warming factors. Plotted lines are approximate. 


is because Chinese sulfate emissions soared 
at the same time that Europe and the United 
States instigated controls that slashed their 
sulfate emissions. It has long been known that 
some air pollutants cool the climate’; what 
is remarkable in the present study is that the 
concurrent changes in different emissions have 
led to a stable overall contribution of China to 
global radiative forcing (Fig. 1). 

Air pollution is a serious environmental 
issue in China, where 1.3 million people die 
each year because of exposure to poor-quality 
air outdoors’. Reductions in the emissions of 
air pollutants are urgently required to improve 
air quality, but this will also affect Earth’s 
climate. Li et al. find that the current composi- 
tion of Chinese air pollution causes almost no 
net radiative forcing — the cooling effects of 
sulfate aerosols balance the warming impacts 
of black-carbon emissions. 

This means that it will be difficult to achieve 
rapid reductions in near-term global warming 
through the control of Chinese air pollutants 
overall — a focus on greenhouse-gas emissions 
in particular will be required. It also means that 
carefully managed mitigation of air pollution 
that focuses on reducing both black-carbon 
and sulfate emissions might have a minimal 
impact on climate, because their effects seem 
to counteract each other. Controlling the com- 
bustion of solid fuels for cooking and heat- 
ing in the home is important in this context, 
because domestic solid fuel accounts for 40% 
of Chinese black-carbon emissions’ and causes 
half a million deaths annually through poor 
outdoor air quality**. 


Liand co-workers went on to explore China’s 
contributions to emissions of CO, and methane 
from pre-industrial times (1750) to the present 
day. They find that China's relative contribu- 
tion to radiative forcing from these greenhouse 
gases has remained remarkably constant over 
this much longer period as well. The exten- 
sive conversion of China’s natural forests to 
agricultural land resulted in substantial CO, 
emissions in the early part of this period. The 
rate of deforestation has declined in recent 
decades, but this has been counteracted by 
increasing fossil-fuel emissions. China is now 
planting forests ona larger scale than anywhere 
else on the planet. These plantations seques- 
ter CO, from the atmosphere, so that Chinese 
forests are now a net sink of this gas. 

Mitigating climate change and air quality 
without unintended consequences will require 
an understanding of many complex inter- 
actions. Current models, including the one 
used by Li et al., do not cover many of these 
complexities. In particular, the authors’ study 
does not consider the formation of secondary 
organic aerosols — which might dominate 
in the haze over China* — from gaseous pol- 
lutants. Detailed monitoring of Chinese air 
pollution is urgently needed to inform the 
development of effective mitigation policies®. 

Air pollutants also interact in complex ways 
with ecosystems: land-use change alters air 
quality’, and deposition of pollution can alter 
forest growth and carbon sequestration®. But 
these effects are not included in many mod- 
els. Recent work’ has shown that fast-growing 
forest plantations in Europe store less biomass 


© 2016 Macmillan Publishers Limited. All rights reserved 


NEWS & VIEWS | RESEARCH | 


50 Years Ago 


The winds of change are sometimes 
almost indistinguishable from placid 
summer breezes. The decision of 
the British Government that British 
money must now be decimal... 
would bring some benefit, not 
disaster ... There will be those who 
claim that the duodecimal system 

is better because twelve has several 
integral factors, though it is at least 
as sensible to argue that the base 
ofall arithmetic should be a prime 
number in order that people should 
not be encouraged to manipulate 
vulgar fractions ... More distantly, 
other feats of rationalization may 
now be attempted. Why not, for 
example, decimalize the day? ... A 
decimalized day should be a much 
more practical proposition. From 
midnight to midnight would bea 
million new seconds. One per cent 
ofa day, or 10* new seconds, would 
be a convenient sub-unit roughly 
equal to a quarter of what is now 

an hour. Astronomers and airline 
travellers alike would welcome — in 
due course — that further proof that 
decimals are not merely reasonable 
but inevitable. 

From Nature 19 March 1966 


100 Years Ago 


Scientific men in their most 

august society are banded 

together “for the improvement 

of natural knowledge.” They are 

by implication a body of students 
working in the temple of Nature for 
truth’s sake alone, heedless of the 
world and its rewards. What they 
garner is their gift to the world: they 
fill another page in the Revelation 
that brings men nearer to the 
angels. Let a man wander into the 
world with his science as wares to 
sell for money profit, and he has 
passed from true brotherhood. 
Surely this idea, perhaps rather 
fancifully stated, is at the bottom of 
much of our exclusiveness. 

From Nature 16 March 1916 
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and absorb more sunlight than do natural 
forests; both of these features reduce the forests’ 
benefit to the climate. Whether similar issues 
are at play across China requires investigation, 
but it is possible that a greater focus than at 
present on protecting and restoring natural 
forests in China might also provide greater 
benefits for global climate. = 
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Swimmers by design 


Scientists have created soft microrobots whose body shapes can be controlled 
by structured light, and which self-propel by means of travelling- wave body 
deformations similar to those exhibited by swimming protozoa. 


IGOR S. ARANSON 


warms of ‘smart’ microrobots scouting 
the human body, delivering medicines or 
assembling complex micromachines have 
long been a popular theme in blockbuster films 
and best-selling novels. Take, for example, the 
1966 film Fantastic Voyage, or Michael Crich- 
ton’s 2002 novel Prey. Although at present the 
concept remains in the realm of science fiction, 
researchers are taking strides towards bringing 
the vision closer to reality. Writing in Nature 
Materials, Palagi and colleagues’ report a big 
advance towards this goal: the creation of a 
synthetic light-powered microrobot inspired 
by swimming Paramecium protozoa. 
Designing a robust microscopic robotic 
swimmer that can navigate complex environ- 
ments and perform useful functions is a key 
component of the quest. To operate autono- 
mously or on demand, a microswimmer 
should be able to harvest 
energy, propel itself through 
fluid towards its target and 
respond to external signals. 
Energy is needed both to over- 
come the friction of the fluid 
and to maintain motion for a 
long time — up to an hour for 
some biomedical applications. 
Several designs exist. One 
is a microscopic gold-plati- 
num rod that self-propels in 
an aqueous solution of hydro- 
gen peroxide (H,O,; ref. 2). 
The microrod decomposes 
H,O, and transfers the energy 
released by this process to the 
water. Ensuing water flow 
pushes the rod forward like a 
miniature submarine. A simi- 
lar design** sees a microscopic, 
rocket-shaped, composite 
platinum-polymer tube gen- 
erate gas bubbles from the 
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decomposition of H,O,, with the detaching 
bubbles propelling the microrocket. 

External magnetic and electric fields can 
also be used to power microswimmers” ’. 
Some swimmers move like snakes or worms, 
gliding on the surface of water by periodi- 
cally bending their bodies*’. Scientists fore- 
see microswimmers being used to unclog 
arteries”°, and to deliver immobile sperm to 
fertilize eggs". 

Nature has mastered highly effective means 
of micrometre-scale propulsion, exemplified 
by the rotation of helical bacterial flagella, 
and the wavy beating of the cilia (tiny hair- 
like structures) that cover Paramecium. This 
metachronal wave — the sequential movement 
of thousands of cilia — enables paramecia to 
swim at astounding speeds”, up to ten body 
lengths per second. (For comparison, a dol- 
phin barely makes two to three body lengths 
per second when in a hurry.) 


Figure 1 | Light-powered swimmer. A microscope (top) projects a moving sequence 
of light (green) and dark (black) stripes onto a photoresponsive, millimetre-long 
polymer rod, inducing periodic deformations (visible as bumps) on the rod’s surface. 
Palagi et al.' show that these travelling deformations propel the soft rod through a 
fluid in a manner that mimics the locomotion of protozoa. 
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Synthetic microswimmers exhibiting this 
amazing wavy propulsion would be highly 
desirable. However, implementing the 
coordinated wavy deformation of the micro- 
scopic swimmer’s body presents a technical 
challenge: the need to manufacture myriad 
minuscule actuators and joints that can be 
individually controlled. 

Palagi and colleagues propose an elegant 
means of microscopic propulsion afforded 
by travelling waves. Instead of a cumbersome 
array of addressable actuators, the authors use 
a synthetic polymer — a liquid-crystal elasto- 
mer. This rubbery material, made up of mol- 
ecules oriented in a certain direction, exhibits 
remarkably strong coupling between molecule 
orientation and mechanical deformation. The 
liquid-crystal elastomer elongates when the 
molecules are fully aligned with each other 
and shrinks when this molecular ordering is 
lost, typically when it is heated or exposed to 
intense light. As a result, this material can be 
highly sensitive to external stimuli such as light 
and heat”’, 

To make a swimmer, the authors illuminated 
a millimetre-long rod of this photoresponsive 
material with a laser beam, using a rectangu- 
lar array of microscopic computer-controlled 
mirrors to project a moving sequence of light 
and dark stripes onto the rod (see Fig. 1). The 
material responded by expanding or shrink- 
ing. Thus, a moving sequence of projected 
light and dark stripes created a 
pattern of travelling bumps 
along the rod, resembling the 
wavy beating of cilia. Remark- 
ably, the rod swam. 

Palagi et al. could tune the 
speed of their soft microrobots 
by adjusting the speed at which 
the projected stripes moved, 
and the motion could be 
started or stopped by turning 
the light on or off, respectively. 
By changing the light patterns, 
it was also possible to control 
several microswimmers simul- 
taneously and coerce them into 
rotating or travelling along a 
designated path. Theoretically, 
their swimming speed should 
increase proportionally with 
the speed at which the light 
stripes move. However, the 
authors found that the mater- 
ial’s time response limits the 
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maximum speed to about 40 micrometres 
per second (or, in terms of body length, about 
30 times slower than a dolphin). 

This work demonstrates a first step towards 
truly bioinspired self-propulsion. However, 
any practical applications would require opti- 
cal access. To make these robots more com- 
petitive with other synthetic swimmers, their 
size needs to be reduced to the micrometre 
scale and their relative swimming speed greatly 
increased. There are no serious technical limi- 
tations on manufacturing smaller and smaller 
polymer rods, but increasing the swimming 
speed is a different issue that comes down to 
material performance. 

Palagi and colleagues’ calculations suggest 
that the swimming speed would remain prac- 
tically unchanged if the swimmers’ size was 
reduced. If so, then a 5-um-long rod (about 


The X-Files 


a 200-fold reduction in size) would still swim 
at about 2-3 um per second (comparable to 
half a body length per second) — slower than 
bacteria and paramecia, but on a par with the 
speeds achieved by other synthetic swimmers 
of similar size’. Advances in high-performance 
photoresponsive materials will be needed to 
further boost the microrobots’ swimming 
speed. = 
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of hepatitis B 


The HBx protein of hepatitis B virus has been found to co-opt a host-cell enzyme 
that targets the Smc5/6 protein complex for degradation. The finding identifies 
Smc5/6 as acellular antiviral factor. SEE LETTER P.386 


T. JAKE LIANG 


in the science-fiction television show 

The X-Files, the hepatitis B virus research 
community has for decades been chasing its 
own unsolved mystery — the truth about the 
enigmatic ‘X’ protein of this virus. The HBx 
protein was first shown to be involved in 
transcriptional activation, and has since been 
implicated in diverse cellular pathways, includ- 
ing signal transduction, apoptotic cell death, 
cell-cycle regulation and DNA repair. But how 
HBx exerts its effects has remained unclear. In 
this issue, Decorsiére et al.’ (page 386) provide 
intriguing evidence that HBx mediates the 
degradation ofa host antiviral (restriction) fac- 
tor by interacting with the ubiquitin—protea- 
some system — the major protein-degradation 
system in cells. 

Hepatitis B virus (HBV) is a small DNA 
virus that has a partially double-stranded 
genome and replicates through an RNA 
intermediate. After entry into a host cell, the 
genome is converted to covalently closed cir- 
cular DNA (cccDNA) that exists as a mini- 
chromosome in the nucleus and serves as the 
template for viral gene transcription. Viruses 
that are related to the HBV that infects humans 
have been discovered in many species, includ- 
ing ducks, woodchucks, squirrels and diverse 


L: special agents Mulder and Scully 


species of bat. The virus predominantly infects 
liver cells, and can cause chronic infection even 
in the presence of an intact immune response. 
Much of the uncertainty about the function 
of HBx has been attributed to the limitations of 
the experimental models in which it has been 
studied, because most are based on non-infec- 
tious systems. However, it is clear that HBx is 
required for effective HBV infection in vivo; 
woodchuck hepatitis 

virus that harbours 


The discovery defects in the gene 
of Smc5/6asa encoding this protein 
viralrestriction _ is poorly infective’’. 

factor adds Many host factors 
to alist of are known to interact 
cellular-defence with HBx. Among 
mechanisms them, damaged 
against DNA- DNA binding pro- 
viralpathogens. ein 1 (DDB1) was 


first identified using 
a genetic approach’. 
This interaction was subsequently validated by 
structural and functional studies. But how this 
seemingly unrelated DNA-damage-response 
protein is involved in the function of HBx 
remained unclear. Like many typical break- 
throughs in science, advances in an unrelated 
field provided the connection. 
The ubiquitin-proteasome system has 
garnered much attention because of its 
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central role in many cellular processes” , and its 
components and biochemical pathways are 
well understood. Among them is an E3 ligase 
enzyme named CRL4. This member of the 
Cullin-RING ubiquitin ligase family uses 
DDB1 as an adaptor protein with which to 
target specific proteins for degradation. Sev- 
eral viral-gene products are known to target 
the CRL4-DDB1 complex’, suggesting that 
the ubiquitin—proteasome system may be a 
common cellular pathway exploited by viruses 
to ensure productive infection. Decorsiére 
et al. set out to identify proteins targeted for 
destruction by the complex formed between 
CRL4, DDB1 and HBx. 

Using a clever protein-interaction approach, 
the authors identified the Smc5/6 protein 
complex, which is involved in several aspects 
of chromosome biology, as one such target 
(Fig. 1). HBx redirects the enzymatic function 
of CRL4 to target Smc5/6 for ubiquitina- 
tion — a modification that marks the protein 
for degradation. They then used genetic 
and biochemical methods to show that the 
Smc5/6 complex indeed associates with 
the HBV genome (probably with the cccDNA, 
although this is not shown definitively) to 
inhibit viral transcription. A previous study’ 
of Cullin-RING ligases did not identify the 
Smc5/6 complex as a target of CRL4, so it is 
unclear whether the complex is a natural sub- 
strate of CRL4. It is possible that HBx alters 
the substrate specificity of CRL4 such that the 
enzyme targets cellular proteins, in addition to 
the Smc5/6 complex, for degradation. 

The discovery of Smc5/6 as a viral restric- 
tion factor adds to a growing list of intrinsic 
mechanisms in the cellular defence arsenal 
against DNA-viral pathogens*”. It seems that 
the Smc5/6 complex binds to and sup- 
presses only extrachromosomal (episomal), 
not chromosomally integrated, HBV DNA. 
This episome-specific function is reminis- 
cent of another class of antiviral factor, the 
APOBEC family, which specifically targets 
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Figure 1 | Hepatitis B virus evasion of cellular antiviral function. After a hepatitis B virus (HBV) 
particle has entered a host cell (during which it becomes de-coated), its genome is converted to covalently 
closed circular DNA (cccDNA) that exists as a mini-chromosome in the nucleus and serves as the template 
for viral gene transcription. Three HBV proteins have well-defined functions: a core protein (named 
HBcAg, or HBeAg when secreted), a reverse transcriptase enzyme (Pol) and an envelope protein (HBsAg). 
Decorsiére et al.’ reveal that another viral protein, HBx, acts to degrade a cellular antiviral factor, the 

Smc5/6 protein complex. The authors show that Smc5/6 probably binds to the HBV cccDNA and thus 
inhibits viral transcription. But HBx interacts with DDB1, an adaptor protein for the cell’s CRL4 E3 ubiquitin 
ligase enzyme complex, and this results in the Smc5/6 complex being targeted for ubiquitination — a 
modification that designates the complex for degradation by the cell’s proteasome machinery. 


episomal foreign DNA for modification and 
degradation”. It has been suggested that the 
APOBEC3A protein, whose expression is 
induced by interferon signalling proteins, 
binds to and edits HBV cccDNA, leading to 
cccDNA degradation". Such pathways reveal 
that diverse cellular mechanisms have evolved 
to defend against HBV infection. 

The Smc5/6 complex has been implicated in 
cell-cycle progression, chromosome organi- 
zation and DNA repair”, but little is known 
about its involvement in transcriptional 
regulation. Decorsiére et al. have shown that 
this complex may have a key antiviral role by 
binding to viral genomes and silencing their 
transcription. How Smc5/6 targets episo- 
mal DNA to exert this silencing effect, and 
whether it has a similar activity against other 
DNA viruses, remain to be investigated. There 
is evidence’’ that HBV transcription is tightly 
regulated by epigenetic modifications (those 
that alter gene expression without modifying 
the nucleic-acid sequence), and it seems that 
HBx epigenetically modifies the HBV mini- 
chromosome. It is possible that the Smc5/6 
complex silences transcription by affecting the 
epigenetic status of the viral mini-chromosome. 

It has been suggested that HBx is present 
on the HBV mini-chromosome”, although 
it is not known whether its association with 
the genome is necessary for the targeted 
degradation of the Smc5/6 complex. It is also 
unclear whether HBx interacts directly with 
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Smc5/6; because the complex contains many 
proteins, it could be that a yet-unidentified 
factor in Smc5/6 is the direct target of the 
CRL4-DDB1-HBx complex. 


CORAL REEFS 


Another remaining puzzle is why the gene 
that encodes HBx exists in the mammalian but 
not in avian hepatitis viruses. Did the inter- 
action of this protein with Smc5/6 emerge 
as mammalian HBV diverged from its avian 
counterparts? It is intriguing to speculate that 
the HBx gene might have been acquired from 
the host as the virus entered mammals more 
than 10,000 years ago; acquisition of host 
genes is a typical evolutionary event for many 
viruses. It seems likely that Decorsiére and 
colleagues’ discovery is not the final episode of 
the “HBx files. m 
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Turning back time 


An in situ experiment finds that reducing the acidity of the seawater surrounding 
a natural coral reef significantly increases reef calcification, suggesting that 
ocean acidification may already be slowing coral growth. SEE LETTER P.362 


JANICE M. LOUGH 


Australia’s Great Barrier Reef was localized 

damage caused by European sailing ships 
suchas HMS Endeavour, which struck the reef 
on 11 June that year. Wind the clock forward 
almost 250 years and the threats to tropical 
coral reefs worldwide have escalated to a level 
that imperils the survival of these complex, 
diverse and beautiful ecosystems’. On page 362 
of this issue, Albright et al.’ report an elegant 
field experiment in which they measured the 
response of a coral-reef community in the 
southern Great Barrier Reef to ocean chem- 
istry conditions that were characteristic of the 


EF 1770, the only external threat to 
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pre-industrial era. By turning back time in 
this way, they demonstrate that, all else being 
equal, net coral-reef calcification would have 
been around 7% higher than at present, sug- 
gesting that ocean acidification may already be 
diminishing coral-reef growth. 

Threats to coral reefs range from regional 
factors, such as overfishing and land-based 
pollution, to global-scale issues arising from 
human interference in Earth's climate sys- 
tem, causing warming and acidification of the 
oceans. Tropical coral reefs have shown their 
sensitivity and vulnerability to the relatively 
modest amount of warming that has already 
occurred. This warming has, for example, 
resulted in mass coral-bleaching episodes 


KEN CALDEIRA 


on many of the world’s reefs, most 
dramatically during the powerful 
1997-98 El Nifio event, and again 
during the equally large 2015-16 
El Nifio. Alongside this warming 
is the ongoing acidification of the 
oceans, caused by increased uptake 
of carbon dioxide from the atmos- 
phere. Acidification is projected to 
increasingly compromise the ability 
of calcifying marine organisms to 
produce the calcium carbonate that 
forms their skeletons and shells’. 

The skeletons of coral animals 
form the backbone of tropical coral 
reefs and provide habitat for many 
thousands of other organisms. Calci- 
fication rates must be fast enough 
to withstand the natural forces of 
physical and biological erosion. Coral 
animals achieve this with the aid of 
the extra food and energy provided 
by single-celled symbiotic algae called 
zooxanthellae. These algae provide 
the beautiful colours of corals, but 
under stressful environmental con- 
ditions, such as unusually warm 
water temperatures, the algae are 
lost from the coral tissue, resulting in 
bleaching. 

Corals build their skeletons by 
precipitating calcium and carbonate 
ions from seawater, but the absorp- 
tion of CO, by seawater shifts the 
equilibrium away from carbonate- 
ion availability towards bicarbonate 
ions. The world’s oceans absorb 
about 25% of the extra CO, that humans 
inject into the atmosphere — without 
this sink the planet would have warmed more 
than it has so far. But this absorption has 
decreased the pH, carbonate-ion concentra- 
tion and saturation state of carbonate minerals 
of the surface oceans, collectively referred to as 
ocean acidification’. 

Since alarm bells starting ringing about the 
probable consequences of these changes for 
tropical coral reefs’, numerous laboratory and 
field experiments have confirmed that calcifi- 
cation rates decline with ocean acidification. 
However, these studies have focused primar- 
ily on single coral species in experimental 
environments manipulated to mimic future 
conditions. The beauty and novelty of Albright 
and colleagues’ study is that they restored 
the ocean chemistry of a natural reef to that 
of pre-industrial times, thus factoring out 
other potentially confounding factors, such as 
temperature. 

The authors took advantage of the unusual 
hydrodynamic environment at One Tree Reef 
in the southern Great Barrier Reef (Fig. 1). 
Here, three lagoons are isolated from the 
ocean at low tide, but water flows between two 
of the lagoons (as a result of elevation differ- 
ences) across a well-developed reef flat. For 


. 


Figure 1 | One Tree Reef in the southern Great Barrier Reef, Australia. 
The reef includes three lagoons in which water is separated from the rest 
of the ocean at low tide each day. This interval allowed Albright et al. to 
study the effect of reducing the acidity of the water on coral growth. 


60 minutes at low tide each day for 22 days, the 
researchers pumped a non-reactive dye solu- 
tion across the reef flat, which for 15 days was 
enriched with sodium hydroxide to increase 
the alkalinity, and thus reduce the acidity, of 
the water. 

This application of dual tracers — alkalinity 
as an active tracer and the dye as a passive tracer 
— has been previously applied to atmospheric 
aerosols. The tracing allowed the authors 
to calculate how much of the added alkalin- 
ity was taken up by the reef-flat community 
(around 17%). The authors also recorded 
an increase in aragonite saturation state (a 
measure of the availability of dissolved car- 
bonate and calcium ions) of around 0.6 and 
an approximately 7% increase in net coral- 
community calcification, findings that closely 
match results from previous experimental 
manipulations of ocean chemistry’. 

One implication of these discoveries is that 
ocean acidification may already be contribut- 
ing to observed declines in coral growth’. As 
Albright et al. acknowledge, ocean chemistry 
is not the only influence on coral calcifica- 
tion. The surface temperatures of the tropical 
oceans were also significantly cooler in pre- 
industrial times®. Although current rates of 
warming are deemed detrimental, some corals 
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may have initially benefited from 
warmer oceans’. We need to consider 
not just what ocean acidification and 
warming separately mean for coral- 
reef calcification, but also their com- 
bined effects and how these factors 
interact with local stressors, as well 
as the consequences for coral-reef 
communities as a whole. Extension 
of Albright and colleagues’ approach 
to include other factors could help us 
to get to grips with the relative risks of 
a rapidly changing future for different 
coral reefs and locations. 

By turning back time in an experi- 
mental scenario, Albright et al. have 
demonstrated the vulnerability of 
the process of reef calcification to 
ocean acidification. But we cannot 
turn back time for the world’s tropi- 
cal coral-reef ecosystems; we have 
already committed them to a warmer 
and more acidic future that is likely 
to fundamentally change them from 
the diverse and beautiful ecosystems 
of the pre-industrial era to much sim- 
pler ecosystems that are not domi- 
nated by corals’. Such a loss will also 
affect many millions of people who 
depend on coral reefs for their liveli- 
hoods and food security. 

The 2015 Paris climate agreement 
to keep the average global tempera- 
ture “well below 2°C above pre- 
industrial levels” contends that this 
limit “would significantly reduce the 
risks and impacts of climate change”. 
But a 2°C limit may not be enough for tropical 
coral reefs, which are at high risk of substan- 
tial impacts from ocean warming and acidifi- 
cation, even with the most stringent emission 
targets’” m 


Janice M. Lough is at the Australian Institute 
of Marine Science and the Australian Research 
Council Centre of Excellence for Coral Reef 
Studies, James Cook University, Townsville, 
Queensland, Australia. 

e-mail: j.lough@aims.gov.au 


1. Hoegh-Guldberg, O. et a/. Science 318, 1737-1742 
(2007). 

2. Albright, R. et a/. Nature 531, 362-365 (2016). 

3. Doney, S.C., Fabry, V. J., Feely, R. A. & Kleypas, J. A. 
Annu. Rev. Mar. Sci. 1, 169-192 (2009). 

4. Jiang, L-Q. et al. Glob. Biogeochem. Cycles 29, 
1656-1673 (2015). 

5. Kleypas, J. A. et al. Science 284, 118-120 
(1999). 

6. Chan, N. C. & Connolly, S. R. Glob. Change Biol. 19, 
282-290 (2013). 

7. Lough, J. M. & Cantin, N. E. Biol. Bull. 226, 187-202 
(2014). 

8. Tierney, J. E. et al. Paleoceanography 30, 226-252 
(2015). 

9. Fabricius, K. E. et al. Nature Clim. Change 1, 
165-169 (2011). 

10.Gattuso, J.-P. et al. Science 349, 45 (2015) 


This article was published online on 24 February 2016. 


17 MARCH 2016 | VOL 531 | NATURE | 315 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


doi:10.1038/nature17148 


Interface dynamics and crystal phase 
switching in GaAs nanowires 


Daniel Jacobsson!, Federico Panciera*, Jerry Tersoff*, Mark C. Reuter*, Sebastian Lehmann!, Stephan Hofmann?, 


Kimberly A. Dick! & Frances M. Ross* 


Controlled formation of non-equilibrium crystal structures is one of the most important challenges in crystal growth. 
Catalytically grown nanowires are ideal systems for studying the fundamental physics of phase selection, and could lead to 
new electronic applications based on the engineering of crystal phases. Here we image gallium arsenide (GaAs) nanowires 
during growth as they switch between phases as a result of varying growth conditions. We find clear differences between 
the growth dynamics of the phases, including differences in interface morphology, step flow and catalyst geometry. We 
explain these differences, and the phase selection, using a model that relates the catalyst volume, the contact angle at 
the trijunction (the point at which solid, liquid and vapour meet) and the nucleation site of each new layer of GaAs. This 
model allows us to predict the conditions under which each phase should be observed, and use these predictions to design 
GaAs heterostructures. These results could apply to phase selection in other nanowire systems. 


Many materials can grow in multiple (meta)stable crystal struc- 
tures, and phase selection is one of the most fundamental problems 
in materials science. However, the selection process is difficult to 
access experimentally; for example, many metastable phases are 
obtained only by rapid quenching of a liquid into a polycrystalline 
multiphase solid. By contrast, nanowires provide an ideal system for 
studying phase selection. Typical zinc-blende (ZB)-structure III-V 
semiconductors form nanowires in the wurtzite (WZ) structure 
as well as the ZB!~°. Nanowires can easily be switched between 
these phases by varying the temperature, source-material flux or 
impurities*+*'*; and their small diameter guarantees that they are 
single crystals, with phase switching occurring along the growth axis 
where it is easily observed. ZB and WZ semiconductors have different 
band structures!>, which creates opportunities for designing mod- 
ulated nanowire structures with new electronic properties. Crystal 
phase heterostructures are particularly interesting because they can 
access the electronic properties of heterostructure quantum dots for 
photonics and single-electron-transistor applications, but without the 
challenge of achieving compositional control'©””. 

To take full advantage of the possibilities offered by crystal structure 
control, a detailed understanding of the physics behind crystal phase 
selection is required. On the basis of post-growth observations, dif- 
ferent models have been proposed for phase selection. These models 
emphasize the role of supersaturation, catalyst geometry and interfacial 
energies”. Experimental results are typically interpreted in terms of 
the dominant role of one of these factors® *10!72026-28, 

Here, we directly observe the dynamic processes that take place 
during nanowire growth for each crystal phase, and during the switch 
between phases, using in situ transmission electron microscopy. We 
find surprising differences in the structure and dynamics during 
growth of ZB and WZ nanowires. The switching process itself, and the 
associated changes in geometry, provide clues that allow us to develop a 
new model identifying the underlying mechanism driving crystal phase 
selection. In this model, droplet geometry is the key parameter in deter- 
mining structure, but in an indirect way, via its effect on the nanowire 
edge morphology. This understanding allows us to form crystal phase 
quantum dots with atomic layer precision. 


Imaging interface and catalyst geometry 

We observed the two GaAs nanowire crystal phases during growth 
in situ using a Hitachi H-9000 ultra-high vacuum transmission electron 
microscope (UHVTEM)**”®. Si substrates were first covered with pre- 
grown GaAs nanowires using standard metal-organic vapour phase 
epitaxy (MOVPE) and Au aerosol particles with diameters of 30 nm, 
50nm and 70 nm. Such samples were loaded into the TEM and heated 
resistively, using a pyrometer to calibrate the temperature at each heat- 
ing current (see Methods). Pure trimethylgallium (TMGa) and arsine 
(AsH3) were used as precursor gases, and were introduced close to the 
substrate using separate capillary tubes to a maximum total pressure 
during imaging of 2 x 10~° Torr. Details of how the growth parameters 
in situ compare to conventional MOVPE are provided in Methods. On 
heating to temperatures of about 550°C, a liquid AuGa droplet formed 
at the nanowire tips and growth took place at the droplet/nanowire 
interface. Growth was recorded at 30 images per second. Dark-field 
imaging conditions, as used in Fig. 1, allow the crystal structure to be 
distinguished, that is, the WZ phase and the two twin variants of the 
ZB phase (Extended Data Fig. 1). Bright-field imaging conditions, as 
used in Fig. 2, allow a more accurate determination of the dimensions 
of the droplet and the nanowire. 

We find that, in situ, both ZB and WZ GaAs can be grown by varying 
the precursor pressures (the V/III ratio) while maintaining a constant 
temperature. Within the parameter range accessible in situ, WZ GaAs 
forms at higher V/II] ratios and ZB at lower ratios at steady-state con- 
ditions; transient conditions are discussed below. The two phases show 
marked differences in terms of their growth dynamics. For WZ GaAs, 
growth proceeds by step flow across the droplet/nanowire interface 
(Fig. 1a and Supplementary Video 1). Steps flow slowly with each one 
starting as soon as the previous one has completed its flow (Extended 
Data Fig. 2a). By counting the number of step-flow events and cor- 
relating with the length of the nanowire (Extended Data Fig. 2b), as 
in ref. 31, we find that each step flow represents the addition of one 
WZ GaAs(0001) bilayer, with a height of 0.3 nm. The growth rates are 
low under the conditions accessible in situ, typically one bilayer per 
minute (see Methods and Supplementary Video 1). Growth rates are 
proportional to AsH3 pressure (Extended Data Fig. 2c), which suggests 
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Figure 1 | Interface dynamics during WZ and ZB growth of GaAs. 

a, Images extracted from a dark-field movie recorded during WZ GaAs 
growth. A step flows across the top facet of a nanowire that has a diameter 
of 60 nm; the position of the step is indicated by the arrows. See also 
Supplementary Video 1. Growth conditions: 550°C, AsH; pressure of 

1 x 10~° Torr, TMGa pressure of 3.5 x 10-8 Torr. The narrow stripes show 
previously grown ZB segments: ZB can occur in two twinned orientations, 
one appearing bright and the other dark in this imaging condition. 

b, Images extracted from a dark-field movie recorded during ZB GaAs 
growth. The truncation slowly fills then jumps back to its maximum 

size (the simultaneous rapid step flow across the growth interface is not 
visible). See also Supplementary Video 2. Growth conditions (not steady- 
state for this example): 550°C, AsH; pressure increased from 10~ Torr to 
1.4 x 10~° Torr a few seconds before the first image was recorded, TMGa 
pressure of 2.0 x 10-8 Torr. ZB phases appear bright and WZ dark in this 
imaging condition. Note that the truncation shows strongly on one side of 
the nanowire; this was typical (see text), although some (5%) nanowires 
showed synchronized oscillation on both sides, as in Supplementary 
Video 5. The relative time of each image in a and b is shown in seconds. 

c, The first ZB bilayer growing on WZ, with imaging conditions as in b. 
The AsH; pressure was reduced; the droplet is in the process of growing 
larger past the critical volume; the first layer of ZB appears followed 
immediately by a truncation that cuts into the previously grown WZ. 

d, The first WZ bilayer growing on a ZB segment. The AsH3 pressure was 
increased; the droplet is in the process of growing smaller; the truncation 
fills in and the first layer of WZ appears via step flow. The truncation does 
not appear, and slow step flow occurs, even though ZB covers the top facet. 
Scale bars are 10 nm in all images. 


that growth under these circumstances is limited by the arrival and 
incorporation of As (see Methods). We can then understand the step- 
flow dynamics through the solubility of As in AuGa, which is generally 
accepted to be low*’. Since the droplet contains no reservoir of the 
rate-limiting species (in our case, As), the arriving atoms are incor- 
porated immediately into the nanowire, leading to slow and gradual 
step flow’. 

The growth of ZB GaAs looks quite different (Fig. 1b, Supplementary 
Video 2). Growth similarly proceeds by addition of bilayers, but 
each bilayer flows across the growth interface too rapidly to observe. 
Furthermore, the droplet/nanowire interface shows an oscillat- 
ing geometry at the trijunction (at which solid, liquid and vapour 
meet) that is similar to that seen in Si, ZB GaP, Ge and Al,O; 
(refs 30, 31, 34, 35). The edge of the nanowire appears truncated 
(Fig. 1b, first panel). The three-dimensional geometry of this ‘edge 
facet’ is shown schematically in Extended Data Fig. 1. Material 
gradually adds to the edge facet to fill in the corner (Fig. 1b, second 
and third panels). The interface jumps forwards as one step flows 
quickly, and the edge facet reappears (Fig. 1b, fourth and fifth panels). 
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Figure 2 | Changes in droplet volume during phase switching. a, Series 


of bright-field images obtained during growth of a GaAs nanowire at 
varying AsH;, pressure, constant TMGa pressure (2 x 10~* Torr) and 
constant temperature (550°C). Scale bar is 10nm. b, Droplet aspect ratio 
h/d and angle ¢ as defined in a. The times of the images in a are indicated 
by coloured boxes. The droplet volume takes several minutes to respond 
to the change in pressure. The crystal phase is indicated by blue (WZ) 
and red (ZB) squares. Each red square marks the occurrence of a 
truncation of the top facet and nucleation of a ZB bilayer. Each blue 
square marks the identification of WZ growth via step flow. c, AsH3 
pressure variation over time. 


Each oscillation in trijunction geometry is correlated with the nucle- 
ation and flow of a new bilayer, as in ref. 31. The step moves quickly, 
even with no reservoir of As, because it is supplied by material from 
the truncated volume. A rough estimate of the change in the trun- 
cated volume is consistent with it being the source of the one bilayer 
of growth. 

In parallel with the observed changes in the interface dynamics, the 
droplet geometry also changes as we vary the V/III ratio to achieve 
growth of WZ and ZB GaAs. Figure 2 and Supplementary Video 3 
show the effect on the droplet of changing the AsH3 pressure between 
high values (to achieve WZ) and low values (to achieve ZB) at con- 
stant TMGa pressure (2 x 10-® Torr). Temperature was kept constant 
throughout the experiments (at 540-560 °C) to avoid introducing 
temperature dependencies that would obscure the observed trends 
(see Methods). Throughout the changes in AsH3 pressure, the nano- 
wire continued to grow. The most obvious feature is the change in 
the volume of the droplet. We quantify this in Fig. 2b via the drop- 
let aspect ratio h/d (droplet height divided by nanowire diameter at 
the growth interface) or, equivalently, via the droplet angle ¢ (angle 
between the basal plane and the tangent to the droplet at its edge; 
see Fig. 2a). On decreasing the AsH; pressure, the droplet increases 
in volume; increasing the AsH; pressure decreases the droplet vol- 
ume. A quasi-steady-state volume is reached, depending on the V/III 
ratio. These volume changes must be driven by the addition or sub- 
traction of Ga: we do not expect Au to move in and out of the droplet, 
because its diffusion on GaAs is assumed to be negligible at this tem- 
perature”, whereas As makes up only a small fraction of the volume, 
owing to its low solubility (discussed above). An Au-Ga-—As alloy 
with over 40% Ga forms a liquid at this temperature, with no upper 
limit on the Ga content*?; consequently, droplets of a range of volumes 
are possible. Furthermore, because the volume changes occur more 
quickly than the rate at which Ga could be consumed by incorpora- 
tion into the growing nanowire, the Ga must be supplied or removed 
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Figure 3 | Growth of a WZ nanowire containing multiple narrow ZB 


segments. TMGa pressure (2 x 10~* Torr) and temperature (550°C) were 
constant. a, Droplet aspect ratio h/d and angle ¢ versus time (grey line), 
and crystal phase versus time (squares). Each square indicates the addition 
of a bilayer of WZ (blue) or ZB (red), as described in Fig. 1. ZB segments 
with thicknesses of zero, one or two bilayers form each time h/d increases. 
b, AsH; pressure variation over time. The pressure was held at 1 x 10° 
Torr to grow WZ, but was pulsed downwards to less than 10-8 Torr for 
seven intervals with durations 5 min, 5 min, 7 min, 7 min, 9 min, 9 min 
and 5 min, respectively. c, d, Images of the nanowire at the start (d) 

and end (c) of the experiment; scale bar is 10 nm. Inc, the position 

of the growth front at each of the seven intervals is indicated by an arrow. 
The ZB segments grown in the 7- and 9-min intervals are visible as 
narrow stripes. Three segments have one ZB twin orientation (bright 
contrast in this imaging condition) and one has the other ZB twin 
orientation (dark contrast). 


by surface diffusion along the nanowire. In the simplest picture’’, 
there is a surface reservoir of mobile Ga adatoms that equilibrate 
with the droplet over time whenever the chemical potentials of 
Ga in the droplet (and on the surface) are changed by altering the 
AsH;3 pressure. In this way, the V/III ratio controls the droplet size. 
(A more complete treatment would include diffusion and the effect 
of As flux on Ga diffusion, but this would not change the general 
picture.) 

We have shown above that crystal structure and droplet volume 
both change as the V/III ratio varies. We now explore how they cor- 
relate with each other. During a growth experiment, it is possible to 
measure crystal structure and droplet volume by alternating between 
dark- and bright-field imaging conditions. The crystal structure iden- 
tified during the experiment in Fig. 2a is shown as the red and blue 
data points in Fig. 2b. It is clear that the switch between WZ and ZB 
crystal structure occurs as the droplet passes a certain aspect ratio— 
under these conditions, this is h/d~0.95 and ¢ 125°. (Other exper- 
iments show a small hysteresis that is not visible in this data; see, for 
example, Supplementary Fig. 3.) Because the droplet takes several 
minutes to respond to the pressure change, it is clear that WZ-ZB 
growth is correlated to h/d and ¢ rather than to the instantaneous 
AsH;3 pressure. This direct correlation between crystal switch and 
droplet dimensions (volume, aspect ratio and angle), governed ulti- 
mately by the V/III ratio, is a key result that provides the basis for the 
model we develop below. 

Understanding that the droplet geometry is the critical parameter, 
rather than the gas environment itself, provides useful guidance in 
growing crystal phase heterostructures. The length of each crystal 
phase segment depends on the time during which the droplet has 
the appropriate geometry. The relatively slow kinetics of the change 
in droplet volume mean that the V/III ratio must be designed with 
appropriate offsets in timing. An example is shown in Fig. 3. Here, 
the V/III ratio was set initially at a value that formed WZ GaAs. 
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The AsH3 pressure was then decreased, for short pulses, to a V/III ratio 
that would be expected to form ZB GaAs. The result is a series of ZB 
inclusions in a WZ nanowire with lengths that are repeatable, but not 
directly proportional to the pulse duration. The shortest pulses did 
not form ZB GaAs at all. Longer pulses produced one or two bilayers 
of ZB stacking. The correlation between droplet volume and crystal 
phase in Fig. 3 confirms that the droplet must reach a critical volume 
for the structural change to occur; the reduction of AsH3 pressure in 
itself may not trigger a structure change. Designing a crystal phase 
heterostructure thus requires consideration of the kinetics of the 
change in droplet volume. 


A model for interface geometry 

Developing a framework to understand the relationship observed above 
between droplet volume, interface growth dynamics and crystal struc- 
ture requires two additional key observations. The first observation is 
that when the droplet changes volume, it changes composition too. This 
could in principle affect phase selection, for example, by changing the 
surface energy~”. It is therefore not immediately clear which factor deter- 
mines crystal structure, the Au:Ga ratio or the geometry (h/d and 4). 
To establish this we measured h/d and ¢@ at the switch for one particular 
nanowire, then allowed the nanowire to increase its diameter by con- 
formal growth on the sidewalls, and again measured h/d and ¢ while 
inducing a switch (Extended Data Fig. 3). Because the nanowire widens, 
but the amount of Au present does not change, relatively more Ga is 
needed to achieve the same h/d and ¢. We did not see any strong effect 
of diameter on the switch between crystal phases; h/d and @ appear to 
be the controlling parameters. 

The second observation concerns the relationship between crys- 
tal phase and the dynamics at the growth front. In Fig. 1c, d and 
Supplementary Videos 2 and 4 we show the crystal switch in more 
detail, specifically the growth of the first ZB layer on WZ GaAs and 
the first WZ layer on ZB GaAs. Starting from a WZ nanowire, we 
reduce the AsH; pressure (Fig. 3c); the droplet enlarges and the first ZB 
layer forms as the critical h/d and @ are reached. As the ZB step flows 
(too rapidly to see) across the growth interface, an edge facet appears 
(Supplementary Video 4). This facet cuts into the WZ beneath. Thus, 
even though steady-state growth of WZ GaAs proceeds without an edge 
facet, it is possible to form an edge facet in WZ GaAs under appropriate 
conditions. Conversely, if we start from a ZB nanowire and increase the 
AsH3 pressure, as shown in Fig. 1d, then the droplet shrinks and the 
first WZ layer grows. It grows by slow step flow and without an edge 
facet appearing (Supplementary Video 2), even though ZB is exposed 
on the growth interface as it starts to grow. 

These morphology observations are particularly surprising. One 
might expect the presence of an edge facet to be controlled by the 
crystal structure at that facet; but, instead, Fig. 1c, d shows that it 
correlates with the crystal structure on the main (that is, ZB(111) 
or WZ(0001)) growth facet, and thus with h/d and ¢. To determine 
cause and effect, we analyse the ways in which the droplet angle ¢ 
affects the morphology of the growth interface. Equilibrium crystal 
shapes generally do not have edge angles as sharp as 90°, so one would 
expect an edge facet in a GaAs crystal. A sharp edge only exists during 
nanowire growth because the droplet is present, providing a capillary 
force that can pull on the edge facet and shrink its size to zero. We 
calculate the circumstances under which this occurs. We assume an 
ideal, symmetric nanowire for which the droplet angle ¢ is the same 
all around the edge (see Fig. 4a); the more realistic, asymmetric case 
is discussed below. The difference in free energy between this ideal 
nanowire with an edge facet and a nanowire with the same geometry 
but a sharp edge is?! 


1 
AE=cyL + —y"(cotO)(Hexe— Mo)L + cay"E (1) 


in which L is the total length of the edge, y is the facet length and 
0 is the facet angle (see Fig. 4a), {Ucar — Uo reflects the supersaturation 
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Figure 4 | Model relating droplet size to interface morphology. 

a, Schematic of a quasi-two-dimensional, ideal, symmetric nanowire 
illustrating the droplet angle ¢, and the edge facet angle 0 and length 

(in the growth direction) y. ‘N’ marks the interior point at which ZB 
nucleates; “T’ marks the trijunction. b, Edge facet length y versus droplet 
angle ¢, calculated for two values of supersaturation (blue, low; green, 
high) for the symmetric nanowire. There is a range of ¢ in which y= 0; in 
this range c; > 0 and the edge facet does not exist. Supersaturation does not 
affect this range. c-e, Schematics of the nanowire and droplet for ¢ < 90° 
(c), ¢= 90° (d) and ¢ > 90° (e). The possibility of the droplet depinning 
from the point T is not included in the model. 


Facet length, y (a.u.) 


of chemical potential j: in the catalyst, and c. includes various other 
second-order terms*". For a sufficiently small facet length y, this 
energy is dominated by the linear term cj, which reflects the capil- 
lary forces acting on the corner facet. Therefore, for c; < 0, it is always 
energetically favourable to have the edge facet (AE < 0), whereas for 
c; > 0, we expect the edge to be sharp everywhere and have no facet. 
By examining c, in more detail and including all the capillary terms”’, 
we find 


cos0 ; 
Vis VW sing (2) 
sind 


oe leg Me 
in which 7, is the liquid-solid interfacial energy at the edge facet, 7; 
is the vapour-solid interfacial energy on the sidewall, 7, is the liquid- 
solid interfacial energy at the main growth facet and 7 is the vapour- 
liquid interfacial energy. 

The key point here is the presence of @ in equation (2). This implies 
that the droplet angle alters c; and, hence, changes the lowest-energy 
state of the nanowire from one with an edge facet to one with a sharp 
corner. A hemispherical droplet (¢=90°) yields the maximum possi- 
ble value of c), and so is the most favourable for eliminating the edge 
facet. This is shown in Fig. 4b, in which we calculate the length y of 
the edge facet as a function of angle ¢ for a symmetric, but otherwise 
arbitrary, illustrative case. In our experiments, the droplet is never less 
than a hemisphere (¢ is always greater than 90°) during stable growth. 
Thus, the analysis in equations (1) and (2) predicts that a switch could 
be observed between a large droplet with an edge facet (Fig. 4e) and 
a smaller droplet with a sharp edge (Fig. 4d); that is indeed what we 
observe. 
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Connecting geometry to crystal phase 

We now consider the ways in which the presence or absence of an 
edge facet controls the crystal phase. Without attempting to develop a 
microscopic model, we can understand heuristically how this would 
occur by considering a previous analysis”’. Several models have argued 
that the crystal structure should be determined by the location of the 
nucleation event on the main growth facet®*10!3?0-237638 The argu- 
ment is that the metastable WZ phase can only grow if it has a lower 
nucleation barrier than does the ZB phase. With sharp edges, nucle- 
ation is expected to occur at the trijunction (rather than in the mid- 
dle of the facet), so the solid-vapour interface plays a critical part. In 
particular, the solid—-vapour interface energy is thought to be lower for 
WZ nanowires, reducing the nucleation barrier for WZ relative to ZB 
in this geometry*”. However, when edge facets are present, nucleation 
on the main facet occurs away from the trijunction (presumably at 
position ‘N’ in Fig. 4a; ref. 31) and the liquid-vapour interface plays no 
part. If we adopt this argument, then it is no surprise that the change 
in the trijunction geometry can result in easier nucleation of WZ than 
ZB in one case, but not the other. This argument is also qualitatively 
consistent with in situ X-ray diffraction studies that infer (indirectly) 
that crystal structure in GaAs nanowires is determined by the geom- 
etry of the liquid—vapour interface***°. Here, we have not considered 
effects of interlayer interactions on the nucleation barrier, which might 
lead to formation of higher-order crystal phases, such as 4H, under 
certain conditions*', because we do not observe such phases in our 
experiments. 

At very small h/d, our analysis also suggests the possibility of an 
edge facet and, hence, ZB growth (Fig. 4b). Although we cannot access 
such conditions in our experiments, they could occur transiently at the 
beginning of nanowire growth, because the droplet has much a smaller 
h/d ratio when sitting on a flat surface’”, and perhaps at the end of 
growth if material in the droplet is consumed. Indeed, the ZB phase has 
been observed at the bases and tips of WZ nanowires”°, although under 
growth conditions that are different enough that our model might not 
be applicable. Recent experiments have demonstrated two transitions 
(from ZB to WZ and then back to ZB) as the V/III ratio is increased’, 
consistent with the model in Fig. 4. However, because our experiments 
have only a limited range of V/III ratios, we cannot observe the second 
transition back to ZB and so cannot assess whether the transition is 
associated with interface structure in a way that is analogous to the 
switch at lower group V pressures. 

The discussion above is simplified in several respects. No difference 
in interfacial or surface energies between ZB and WZ structures is 
included. The small, but real, differences could lead to hysteresis in 
the switching angle as the droplet grows and shrinks. However, data 
such as that in Fig. 2, which displays no strong hysteresis, suggest that 
the droplet angle has a larger effect than do the differences between ZB 
and WZ interfacial energies. 

More importantly, our quasi-two-dimensional model treats the 
droplet angle ¢ as uniform all the way around the edge. For the true 
three-dimensional geometry, in which the droplet sits on a hexa- 
gonal prism whose side lengths may not be equal", it is clear that 
@ will vary around the trijunction. Suppose that for a WZ nanowire 
we change the conditions to enlarge the droplet. At some point, @ 
will become large enough along one edge for that edge to become 
truncated, even though other edges remain sharp. As the droplet 
continues to grow, every other edge will progressively become trun- 
cated. Therefore, we expect any nanowire with unequal edge lengths 
to exhibit a mix of sharp and truncated edges over a range of droplet 
volumes. In the experiments, we observe the ZB phase once the first 
truncated corner appears (Fig. 1 and Supplementary Videos 2 and 4). 
Because we typically stabilize the conditions as soon as we see the 
crystal switch, the majority of nanowires presented here show a mix 
of sharp and truncated edges. However, we also observe symmet- 
rically oscillating nanowires (Supplementary Video 5), presumably 
because the wire is more symmetric or because the droplet has grown 
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large enough to cause all edges to be truncated. The observation 
mentioned above that the ZB phase grows if any edge is truncated, 
whereas WZ grows only when all edges are sharp, implies that nucle- 
ation of ZB at a truncated edge is actually easier than nucleation of 
either WZ or ZB at a sharp one. This somewhat unexpected result 
could provide guidance in refining model parameters in nucleation 
calculations. 


Conclusions 

Direct observation during growth has enabled us to probe the phenom- 
ena controlling crystal phase in nanowires. WZ and ZB crystal phases 
in GaAs appear markedly different during growth in terms of the mor- 
phology of the nanowire/droplet interface, the flow of steps and the 
droplet size. The step-flow kinetics can be understood as a consequence 
of As-limited growth, low As solubility in the droplet, and the role of 
the edge truncation as an alternative reservoir. Examining the switch 
between phases suggests a scenario in which the growth conditions 
(here, the V/II ratio) determine the volume of the droplet and, hence, 
its aspect ratio h/d and angle ¢; the value of @ determines whether an 
edge facet will be present; the presence or absence of the edge facet 
determines the nucleation site for a new layer; and the nucleation site 
determines which phase, WZ or ZB, is most likely to nucleate. Our 
interpretation differs markedly from previous models of phase selec- 
tion. Because nanowire growth has been achieved using a wide range 
of parameters and growth techniques, it is possible that phase selec- 
tion is controlled by different physics under different circumstances. 
However, the regime we analyse here, MOVPE under As-limited con- 
ditions, has advantages for atomic-level control and high-throughput 
manufacturing. 

This understanding of the causal sequence, in particular the changes 
in droplet volume with conditions and the controlling role of the drop- 
let angle @, has practical consequences. First, the large changes in 
droplet volume as a function of conditions may be relevant to aspects 
of nanowire growth other than crystal phase control!?. For example, 
kinking can be caused by depinning of droplets from the nanowire 
tip’”“4, and experiments such as those shown here can explore the 
range of conditions under which droplets attain sizes that are suf- 
ficiently large or sufficiently small to cause depinning. In terms of 
crystal phase control, because the Au:Ga ratio in the droplet seems not 
to be critical, our results might be applicable to self-catalysed (Au-free) 
nanowire growth (although any small difference in liquid surface ener- 
gies could lead to a slightly different critical angle). A second conse- 
quence is that any means of controlling the energy balance between a 
truncated and sharp edge should affect the crystal phase: we used the 
V/III ratio here, but temperature and surfactants are other possible 
ways to tune the crystal phase. We anticipate that similar behaviour 
may occur in other III-V semiconductors that exhibit polytypism, 
although it is not guaranteed because the various interfacial-energy 
parameters are material-specific. Finally, understanding how crystal 
structure switching depends on the kinetics of group III motion into 
and out of the droplet helps us work towards precise control of indi- 
vidual crystal phase superlattices, to enable fabrication of new types 
of electronic devices that make full use of the possibilities for engi- 
neering band structure that are provided by crystal-phase-engineered 
nanowires. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
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METHODS 


The GaAs nanowires imaged in this study were grown on Si(111). The substrates 
were cut from a Si(111) wafer into strips 3mm x 350\1m 500|.m, small enough 
to fit directly into the TEM heating holder. The strips were, however, too small to 
be handled in the GaAs growth system, so they were stacked in arrays, parallel to 
each other with the polished surface facing upwards, and mounted on a larger Si 
wafer. At Lund University, Au aerosol particles with diameters of 30 nm, 50nm 
and 70 nm were deposited onto the arrays of strips using a size-selected aerosol 
source at a total density of about 1 particle perjim”. Then, GaAs nanowires of the 
order of 500 nm in length were grown on the arrays using standard metal-organic 
vapour phase epitaxy in an Epiquip system, operating at 100 mbar with AsH; and 
TMGa as precursor gases and H) as carrier gas. After growth, the arrays were 
glued to sample boxes using a small piece of SEM-type double-sided carbon tape, 
the sample boxes were placed in a plastic bag, which was vacuum sealed, and 
then the bag was sent through air to the UHVTEM at IBM. The individual strips 
were separated and each sample was degassed in UHV by resistive heating below 
100°C for 30 min, flowing a direct current through the Si strip. The heating current 
required for a temperature of around 300°C was then determined in a separate 
UHV chamber using an infrared pyrometer. All of the strips had a similar 
temperature-current calibration, so it was possible to estimate the current required 
to heat the sample to 500°C or 550°C. The sample was transferred to the UHVTEM 
column to check that the nanowires and Au catalysts were still present after this 
process. Finally, TMGa was flowed to a chosen pressure of around 5 x 108 Torr 
as measured using a mass spectrometer, AsH; was flowed to a chosen pressure of 
around 2 x 10~> Torr as measured on the column ion gauge, the nanowires were 
heated to 500-550 °C and GaAs was grown at the nanowire tips. The crystal phase 
was generally controlled using AsH; pressure, which was easier to measure and 
faster to change than TMGa. After experiments on one sample were completed, 
the full current-temperature calibration curve was obtained for that sample. The 
reason for this calibration procedure was to prevent any damage (for example, 
etching) of the wires by overheating before the growth experiment began. Owing 
to drift of the temperature on continued heating, we estimate the temperature 
accuracy to be +20°C. All observations of crystal switching occurred between 
500°C and 570°C. Approaching 500°C, the temperature range over which switch- 
ing occurred became narrower and ZB grew for all accessible pressures. Above 
600°C, the nanowires etched slowly at the Au/GaAs interface, presumably owing 
to the low group V pressure. 

This growth in situ within the TEM is somewhat different from standard 
MOVPE. Even though the conventional MOVPE precursor gases are used, there 
is no H; carrier gas during growth within the TEM, as is typically used during 
MOVPE growth of GaAs. In addition, the absolute pressures of the two precursor 
species are lower than typical precursor partial pressures used in MOVPE. The 
lower partial pressures can alone account for the low growth rates observed here 
compared to those observed in MOVPE. To compare the effects of the V/III 
ratio, we need to consider the possible differences between the two methods in 
more detail. 

The growth in this study was observed to always be group-V limited, as seen in 
Extended Data Fig. 2c. This is in contrast to standard MOVPE, where high group-V 
flows and group-III-limited regimes are typically used. Instead, one could argue 
that the in situ TEM conditions are more similar or relevant to chemical beam 
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epitaxy (CBE), or possibly molecular beam epitaxy (MBE), than to MOVPE. The 
group-V-limited regime is also highly relevant to catalyst-free growth from Ga 
droplets. One exception for MOVPE is a recent study” exploring group-V-limited 
regimes in a standard MOVPE reactor to grow WZ and ZB GaAs. In this work, 
it was shown that at low enough group-V flow to yield group-V-limited growth, 
WZ grows at ‘high’ V/III ratio, whereas ZB grows when the V/III ratio is lowered. 
This result is in contrast to the more well-known behaviour in the group-III- 
limited regime, in which ZB forms at high V/III ratio; however, the result of ref. 9 
is entirely in agreement with the results of this study. Where the present work and 
that presented in ref. 9 differ is in the absolute magnitude of the V/III ratio: here, 
V/III ratios of 100 or more yielded group-V-limited nanowire growth; in ref. 9, the 
group-V-limited regime occurred at V/III ratios of less than 2. 

This comparison suggests that the effective As pressure at the growth front is 
substantially lower in the TEM, relative to the Ga pressure, than in typical MOVPE. 
To understand this, we note that AsH; pyrolysis in GaAs growth is generally con- 
sidered to proceed heterogeneously on GaAs surfaces, without interaction with the 
carrier gas*’. This pyrolysis starts with adsorption of AsH; onto the surface, fol- 
lowed by sequential dissociation of H atoms one by one, eventually leaving atomic 
As adsorbed on the surface*®. When AsH; is combined with TMGa, however, 
the two species decompose together, simultaneously, via adduct formation on the 
surface; this decomposition pathway does not involve hydrogen and is more effi- 
cient than the decomposition of either species alone“’. That the species decompose 
primarily on the surface is an important clue to the relatively inefficient supply of 
As in the UHVTEM. First, the nanowires are grown on Si substrates rather than on 
GaAs; although there is also ample GaAs surface on the pre-grown nanowire stubs, 
this surface is clearly different from the typical GaAs substrates used for MOVPE 
nanowire growth. Second, the surfaces are likely to be passivated with hydrogen 
when growth occurs in a H2 atmosphere; the absence of H here could affect the 
supply in a number of ways, changing, for example, the decomposition process 
and precursor surface diffusion. Finally, the decomposition process relies on the 
desorption of gas-phase As species. This adsorption process naturally depends on 
the partial pressure; because As has a substantially higher vapour pressure than 
Ga, the adsorption process of As will be reduced to a greater extent by the lower 
partial pressure. 

Other minor differences between growth in the TEM and growth ina reactor are 
expected, owing to the experimental set-up. When using needle valves rather than 
standard mass-flow controllers to control the precursor flows, the experimental 
parameters are less accurately controlled than they are in dedicated epitaxy growth 
systems. At the high V/III ratios used (V/III > 100), the TMGa partial pressure 
is much lower than that of AsH3. A gauge reading the total pressure close to the 
sample is used to monitor the AsH; flow and provide fast feedback on the AsH3 
pressure at the sample. To monitor TMGa pressure, a mass spectrometer is used, 
with a controlled, steady pressure of TMGa set at the start of the experiment and 
generally held constant. The mass spectrometer is continuously used during the 
experiments to monitor any drift in the TMGa pressure. 


45. Larsen, C. A., Buchan, N. |. & Stringfellow, G. B. Reaction mechanisms in the 
organometallic vapor phase epitaxial growth of GaAs. Appl. Phys. Lett. 52, 
480-482 (1988). 

46. Tamaru, K. The decomposition of arsine. J. Phys. Chem. 59, 777-780 (1955). 
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<110> (Beam direction) 


Extended Data Figure 1 | Distinguishing crystal phases using dark-field _c, Image recorded during growth, also in the (110) direction, showing 
imaging. a, Dark-field images recorded using three spots in the diffraction the dark-field contrast and the measured angle of the truncated corner. 


pattern showing how WZ and the two variants of ZB (ZB1 and ZB2) are d, A schematic showing the hexagonal cross-section as seen from the side 
distinguished in the (110) direction. The right panel shows a bright-field (beam direction) and from above. Seen from the side, the droplet contact 
(BF) image for comparison. b, Diffraction pattern with the three spots angle ¢ and corner-facet angle 0 are shown. Seen from above, the electron 
indicated. These are post-growth images recorded in a JEOL 3000 TEM. beam direction is shown. 
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Extended Data Figure 2 | Kinetics of WZ growth. a, Step-flow kinetics the standard deviation of each subset (length measured in 3-4 different 
measured for the WZ nanowire shown in Supplementary Video 1 and movie frames for each data point). c, Growth rate versus AsH; pressure, 
Fig. la. b, Length of a WZ nanowire versus the number of step-flow events. estimated by measuring the increase in the length of the nanowire shown 
The average gradient of the graph confirms that each step is 0.33 nm in in Fig. 2 during growth intervals at different AsH; pressures, and by fitting 
height, that is, a WZ(0001) bilayer. The red line shows the calculated a linear function to the length versus time plot. Error bars are defined 
length, assuming that each new bilayer (red squares) adds 0.33 nm in by the standard error of the fits. The growth temperature was 550°C and 
length. The black data are measured lengths, with error bars defined by TMGa pressure was constant at 3.5 x 10-8 Torr. 
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Extended Data Figure 3 | Droplet angle at the transition does not 
change with Au composition in the droplet. ac, During growth, the 
crystal structure of a nanowire was switched back and forth several times by 
changing the AsH; pressure. Data shows the measured angles and absolute 
droplet volumes at which the switch from WZ to ZB (red) and ZB to WZ 
(blue) occurred (a, b). During this time the wire also grew radially (c). 

a, Switch angles versus nanowire diameter. Shading indicates the range of 
observed angles at which WZ switches to ZB (red) and ZB to WZ (blue). 
Some hysteresis in switching is visible, perhaps because the droplet angle 
continues to change in the time before the switched layer grows. The data 
are scattered, but there is no strong dependence of angle on diameter 
(especially for the blue data points). For example, ZB switches to WZ at 


ne 


ratio, 


Droplet as 
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angles between 123° and 132° at both large and small diameter, and WZ 
switches to ZB at angles between 127° and 136°. b, Switch angle versus 
inferred composition, as calculated from the measured h/d ratio assuming 
that the amount of Au does not change. The data are scattered, but there 
is no strong dependence of angle on composition. For example, for the 
first few data points, for which the nanowire had a diameter of 30 mn, 

ZB switched to WZ for a Ga fraction of less than 60%. For subsequent 
data points, for which the nanowire had a diameter of 40 nm, the switch 
did not occur until the composition had a Ga fraction of about 75%. The 
fact that droplet angles are similar despite the change in diameter suggests 
that droplet geometry controls the switch, whereas droplet volume and 
composition do not appear to be important. 
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Lens regeneration using endogenous 
stem cells with gain of visual function 
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The repair and regeneration of tissues using endogenous stem cells represents an ultimate goal in regenerative medicine. 
To our knowledge, human lens regeneration has not yet been demonstrated. Currently, the only treatment for cataracts, 
the leading cause of blindness worldwide, is to extract the cataractous lens and implant an artificial intraocular lens. 
However, this procedure poses notable risks of complications. Here we isolate lens epithelial stem/progenitor cells (LECs) 
in mammals and show that Pax6 and Bmil are required for LEC renewal. We design a surgical method of cataract removal 
that preserves endogenous LECs and achieves functional lens regeneration in rabbits and macaques, as well as in human 
infants with cataracts. Our method differs conceptually from current practice, as it preserves endogenous LECs and 
their natural environment maximally, and regenerates lenses with visual function. Our approach demonstrates a novel 
treatment strategy for cataracts and provides a new paradigm for tissue regeneration using endogenous stem cells. 


Stem-cell therapy holds great promise in regenerative medicine. Much 
attention has been focused on pluripotent stem cells and the use of their 
derivatives for therapeutic purposes. However, several uncertainties, 
including tumorigenicity and immune rejection, have hindered their 
clinical application. An attractive alternative is to harness the potential 
of endogenous stem/progenitor cells for direct use in repair and regen- 
eration. In the case of the ocular lens, regeneration has been reported in 
lower vertebrates!. In mammals, such as rabbits, removal of the orig- 
inal lens content results in the proliferation of residual lens epithelial 
stem/progenitor cells (LECs) and the generation of a limited amount of 
lens fibres**. In humans, varying degrees of disorganized regrowth of 
doughnut-like lens tissues have been observed after congenital cataract 
removal in infants (Extended Data Fig. 1a, b). However, the underlying 
mechanism for these observations remains elusive, and the successful 
regeneration of a complete mammalian lens with biological function 
has yet to be achieved. 

Cataracts are the leading cause of blindness in the world. The 
visual axis, defined as the normal passage of light into the eye, may 
undergo visual axis opacification (VAO) owing to the cataractous lens 
or the postoperative disorganized growth of remaining LECs, leading 
to vision loss®. The current standard of care in congenital cataracts 
involves surgical removal of the cataractous lens with a large central 
capsulorhexis opening and implantation of an artificial intraocular 
lens (IOL) to replace the missing refractive media. Although artificial 
IOLs are widely used in paediatric cataract surgery, they are limited 
by complications”*, and most paediatric patients continue to require 
some form of refractive correction such as eyeglasses after cataract 
surgery”. Furthermore, IOLs are controversial in patients younger 
than two years as they have not been shown to prevent strabismus or 


amblyopia, and normal lens refractive power is not yet fully developed 
at this age!) 

To test the feasibility of in situ lens regeneration, we performed 
in vitro studies on PAX6*/SOX2* LECs and identified BMI-1 as an 
essential factor for maintaining a LEC pool in mammalian eyes by 
conditional knockout experiments. We also investigated the ability of 
LECs to differentiate into lens fibre cells in vitro. We then performed 
in vivo animal studies by establishing a new minimally invasive capsu- 
lorhexis surgery method that differs conceptually from current prac- 
tice in extracting the cataractous lens through a small wound opening, 
while preserving lens capsule integrity and therefore LECs as well. 
Using this method, we investigated lens regeneration in rabbits and 
macaques and conducted a clinical trial in human infants. Functional 
lens regeneration was observed not only in rabbits and macaques, but 
also in human patients with congenital cataracts. Therefore, our new 
study provides a novel approach to lens regeneration using endogenous 
stem cells, and results in improved outcomes. 


Essential role of LECs in lens regeneration 

In the mature lens, LECs cover the anterior surface of the lens and begin 
to differentiate into lens fibres at the equator (Extended Data Fig. 2a). 
Sustained self-renewal and protective capacities against external injury 
and oxidative damage are among the most significant functions of 
LECs"”. To assess the regenerative ability of LECs, we used bromode- 
oxyuridine (BrdU) labelling to identify proliferating LECs from human 
donor lenses. We quantified BrdU* LECs in 8-month-old, 30-year-old, 
and 40-year-old donors and found that the number of proliferating cells 
decreased with age (Extended Data Fig. 2b, c). However, upon surgi- 
cal removal of the entire lens contents with preservation of the empty 
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Figure 1 | Lineage tracing of Pax6* LECs in mice. a, Pax6-directed GFP 
was expressed in mouse LEC nuclei at post-natal days P1, P14 and P30; 
a sagittal section of a PO-3.9-GFPcre mouse lens is shown. Blue and green 
represent DAPI and GFP, respectively. b, Lineage tracing of Pax6+ LECs in 
ROSA™!", PQ-3.9-GFPcre mice at P1, P14 and P30 reveals that lens fibre 


capsular bag scaffold, the number of BrdU* cells increased by 11-fold 
(P <0.05, Extended Data Fig. 2d, e), suggesting a strong regenerative 
capacity of human LECs after injury. 

Paxé plays a central role in eye development as well as in lens 
induction. After birth, Pax6 maintains a high level of expression in 
the lens epithelium, particularly at the germinative zone (Fig. 1a). 
To determine whether Pax6* LECs can contribute to lens fibre cell 
formation, we performed lineage-tracing experiments in mice by 
crossing a Pax6 lens ectoderm enhancer-driven Cre deleter mouse 
strain (P0-3.9-GFPcre) with the ROSA” membrane-bound GEP 
reporter strain. We observed intense membrane GFP* cells throughout 
the entire lens of ROSA”™!"°; Pax6P0-3.9-GFPcre mice at P1, P14, and 
P30; in contrast, the PO-3.9-GFPCre allele alone yielded only nuclear 
GFP expression in LECs detectable by anti-GFP antibody staining 
(Fig. 1a, b). These results indicate that Pax6* LECs from the embry- 
onic or adult lens can contribute to the post-natal replacement of 
mouse lens fibre cells. 

We next isolated and expanded rabbit LECs from neonatal lens 
capsules (see Methods). These LECs showed a cobble-stone-like 
epithelial morphology with highly positive staining for LECs mark- 
ers Pax6 and Sox2, and could be passaged over time (Fig. 2a). Upon 
differentiation, these LECs formed transparent three-dimensional 
convex lens-like structures, defined as lentoid bodies (Fig. 2b, c), 
which possess significant refractive power (Fig. 2c). Immunostaining 
and western blot analysis showed that lentoid bodies expressed mature 
lens-fibre-specific genes, including those encoding aA-, 8-, and 
-crystallins (Fig. 2b, c). 

Loss of LEC homeostasis leads to cataracts 

To further investigate the LEC pool and its role in the maintenance of 
lens function, we studied BMI-1, a member of the Polycomb-group 
family. BMI-1 is known to promote the maintenance and self-renewal 
of stem cells in multiple post-natal tissues!“ and is expressed in both 
the murine lens germinative zone and in cultured human fetal LECs 
(Extended Data Fig. 2f, g and 3a). Knockdown of BMI in human LECs 
led to significantly decreased LECs proliferation in vitro (Extended 
Data Fig. 4a), without affecting expression of key genes in LECs or lens 
fibre cells (Extended Data Fig. 4b). To directly test the effects of con- 
ditional deletion of Bmil on LEC proliferation, we administered BrdU 
to 2-, 7-,and 12-month-old Nestin-cre;Bmil" mice and Bmil™" litter- 
mate controls. After a 20-h pulse, there was no significant difference in 
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b Rosa™™G- PAX6-GFPcre c 


Rosa™lmG 


cells express membrane GFP fluorescence; hence, PAX6* LECs were able 
to generate lens fibre cells. c, As an additional control, the ROSA™" allele 
alone exhibits Tomato (red) staining at sites of non-recombination. All 
scale bars, 100,1m. 


the percentage of BrdU* LECs in 2-month-old Nestin-cre;Bmil/" mice 
and Bmil' controls. However, there was a significant reduction in the 
percentage of BrdU* LECs in 7- and 12-month-old Nestin-cre;Bmil’ 
eyes compared to controls (Extended Data Fig. 3b, P< 0.05). 

We next investigated the mRNA expression levels of Bmil, Sox2 
and Ki67 in Pax6* LECsat the anterior capsule in Pax6P0-3.9-GFPcre 
mouse lens. Compared with Pax6~ (GFP-negative) LECs, Pax6* (GFP- 
positive) LECs located at the germinative zone had higher expression 
levels of Bmil, Sox2 and Ki67 (Extended Data Fig. 5). Moreover, 
conditional deletion of Bmi1 led to a dramatic decrease in the number 
of Pax6*/Sox2* LECs in ageing Nestin-cre;Bmil™" mice (Extended Data 
Fig. 3a, P< 0.001). Additionally, the lenses of ageing Nestin-cre;Bmil™" 
mice became progressively opaque, suggesting cataract formation. 


Li 


Relative expression level [i 
o o 


o wo 
2 


aA-cry. y-cry. B-cry. 


Figure 2 | Characterization and differentiation of rabbit LECs. a, LECs 
were positive for PAX6 (green) and SOX2 (red). b, Lentoid formation 
(green arrows) with positive aA-crystallin and 6-crystallin staining on 
day 15 of LEC differentiation. c, Left panel, phase-contrast photograph 
of a lentoid body on day 30; middle panel, a lentoid body demonstrating 
magnifying properties; right panels, photograph from western blot analysis 
and quantification showing a dramatic increase in expression relative to 
pre-differentiation expression of mature lens fibre markers wA-crystallin 
(2.6, 3.1, 2.2), B-crystallin (11.51, 9.0, 10.2) and y-crystallin (2.2, 2.0, 2.8) 
(numbers in parentheses represent fold change after differentiation). 

n= 3 biological replicates. All scale bars, 100 |1m. 
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Figure 3 | Lens regeneration in rabbits. a, New minimally invasive 
surgical method. The capsulorhexis size was decreased to 1.0-1.5 mm in 
diameter, resulting in a reduced wound area of 1.2 mm/?, and moved to 
the periphery of the lens. b, Slit-lamp microscopy showing the progress 
of lens regeneration after minimally invasive surgery in a rabbit eye. 

c, Fundus examination of rabbit eyes 7 weeks post-surgery demonstrated 
a clearly visible retina. Normal healthy lens shown for comparison. 

d, Measurements of refractive dioptres in rabbit eyes at different time 
points post-surgery (M, month; D, dioptres). Refractive dioptres of the 
eyes increased with time after surgery, demonstrating the functionality 
of the regenerated lenses (ANOVA, *P < 0.01). The refractive power 
immediately after surgery was defined as zero, 1 month = 0.0 dioptre, 

3 months = 11.0 + 0.8 dioptres and 5 months = 15.8 + 2.2 dioptres, n=6 
at each time point, data shown as means + s.d. e, f, Ki67 staining in the 
germinative zone of normal rabbit lens (e) and regenerated rabbit lens 7 
weeks post-surgery (f). Lower panels show higher magnification. g, PAX6 
(red) and BrdU (green) staining at the germinative zone of regenerated 
rabbit lens 7 weeks post-surgery. Scale bars, 100 jim. 


To test this hypothesis, we administered tropicamide drops to the eyes 
of 2-, 7-, and 12-month-old Nestin-cre;Bmil“! mice and Bmil lit- 
termate controls to dilate the pupils (Extended Data Fig. 3c, d). Eyes 
of 2-month-old Nestin-cre;Bmil“" mice (n=3) were indistinguish- 
able from those of age-matched controls (n = 4). However, 100% of 
the 7-month-old (n=5) and 12-month-old (n= 7) Nestin-cre;Bmilf!! 
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mice had bilateral cataracts, while none of the age-matched Bmi ws 
controls (1 =3, 7-month-old; n =5, 12-month-old) developed cata- 
racts. Moreover, haematoxylin and eosin-stained sections revealed the 
presence of cataracts in the 7- and 12-month-old Nestin-cre;Bmi pup 
mice (Extended Data Fig. 3d). These data suggest that Bmi1 loss of 
function disrupted LEC proliferation, thereby depleting the LEC pool 
and promoting cataract formation. 


LEC preservation and lens regeneration 

The current capsulorhexis method performed in paediatric cataract 
surgery involves making a large 6-mm-diameter opening at the centre 
of the anterior capsule, resulting in a large wound area and destruction 
of large numbers of LECs (Extended Data Fig. 1c). To overcome these 
limitations and to facilitate lens regeneration, we established a new 
capsulorhexis method. This new method has two advantages: (1) it 
reduces the size of the wound considerably; and (2) it moves the cap- 
sulorhexis opening from the central visual axis to the periphery. Thus, 
application of this procedure led to improved visual axis transparency 
and preservation of LECs with regenerative potential (Fig. 3a). 

We next investigated lens regeneration in rabbit eyes in vivo. We used 
our new minimally invasive capsulorhexis technique to preserve endog- 
enous LECs while removing the native lens (Extended Data Fig. 6). 
One day after surgery, slit-lamp microscopy showed that the anterior and 
posterior capsules were adherent (Fig. 3b). Four to five weeks after sur- 
gery, the regenerating lens tissue grew from the periphery of the capsu- 
lar bag towards the centre in a curvilinear symmetrical pattern (Fig. 3b). 
Seven weeks after surgery, the regenerating lens tissue formed a trans- 
parent biconvex lens along the anterior—posterior axis with a clear view 
of the posterior segment and retina (Fig. 3b, c), comparable to a normal 
healthy lens (Fig. 3c). The refractive power of the regenerated lenses 
after surgery was evaluated and found to have increased to an average 
of 15.6 dioptres from the first to the fifth month after surgery, a value 
comparable to that of a normal lens!© (Fig. 3d, P<0.01). 

In contrast to the epithelial cells and premature lens fibre cells located 
at the lens equator, the LECs in the germinative zone of regenerated 
lenses showed intense proliferative activity 7 weeks post-surgery, as 
evidenced by both Ki67 and BrdU labelling (Fig. 3e-g). Notably, some 
PAX6* LECs co-labelled with BrdU, demonstrating their proliferative 
potential (Fig. 3g). These LECs lost PAX6 expression concomitant with 
the initiation of differentiation and subsequent migration from the lens 
equator. 

One day post-surgery, histological examination revealed that a 
monolayer of LECs remained intact (Extended Data Fig. 7a). Four 
days post-surgery, LECs migrated onto the posterior capsule from the 
periphery towards the centre in a curvilinear 360° fashion with a single 
layer of epithelium on the posterior capsule (Extended Data Fig. 7a). 
Seven days post-surgery, LECs on the posterior capsule began to elon- 
gate, and their nuclei were positioned anteriorly away from the poste- 
rior capsule (Extended Data Fig. 7a). Twenty-eight days post-surgery, 
a structure with lens fibres and an extruded nucleus was observed 
(Extended Data Fig. 7b). At week 7 after surgery, the regenerated lens 
fibres elongated along the anterior—posterior axis and grew to cover 
the entire posterior capsular area, forming a lens with a double-convex 
shape (Extended Data Fig. 7c). 

We next investigated lens regeneration in macaques 1-3 months of 
age (approximately equivalent to human infants 4-12 months old), 
using a similar minimally invasive surgical technique. From post- 
operative days 1 to 3, no signs of inflammation or other undesired 
side-effects were seen. Two to three months post-surgery, regenerating 
lens tissue had grown from the periphery towards the centre in a 
curvilinear pattern (Fig. 4a). Five months post-surgery, a biconvex lens 
with a transparent visual axis had formed (Fig. 4a, b). Fundus exami- 
nation 7 weeks after surgery showed a clear view of the retina, compa- 
rable to the view of the retina seen through a normal healthy lens. No 
undesired complications, such as macular oedema, retinal detachment, 
or endophthalmitis were observed. 
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Figure 4 | Lens regeneration in macaque models after minimally 
invasive surgery. a, Slit-lamp microscopy showed regenerating lens tissue 
grew from the peripheral to the central lens in a circular symmetrical 
pattern 2-3 months after surgery, reaching the centre at 5 months 
post-surgery. Five months after surgery, direct illumination showed that 
the visual axis remained translucent. b, Pentacam cross-sectional scanning 
showed formation of a biconvex structure 5 months after surgery (yellow 
arrowheads). Direct illumination and fundus photography showed that the 
visual axis remained transparent and the retina was clearly visible (n = 6). 


Lens regeneration in human infants 

Cataracts are a major cause of vision loss in human infants'”. Currently, 
the most commonly practiced surgical procedure involves removal of 
the cloudy lens through a large anterior continuous curvilinear capsu- 
lorhexis (ACCC), combined with either posterior laser capsulotomy or 
posterior continuous curvilinear capsulorhexis (PCCC) and anterior 
vitrectomy (Extended Data Fig. 1), which is followed by artificial lens 
implantation or postoperative aphakic eyeglasses or contact lenses'®. 
However, complications such as VAO often occur. Moreover, difficulty 
with refractive correction of developing eyes, secondary glaucoma, and 
surgery-related complications can lead to a poor outcome’. We con- 
ducted a clinical trial in paediatric cataract patients up to two years of 
age to investigate whether lenses could be regenerated in humans using 
minimally invasive surgery. 

Twelve paediatric cataract patients (24 eyes) underwent minimally 
invasive surgery to promote lens regeneration, while 25 paediatric 
cataract patients (50 eyes) in the control group received the current 
standard-of-care treatment that left them aphakic. Using slit-lamp 
microscopy, we were able to dynamically observe the process of 
in vivo lens regeneration postoperatively. The capsular openings healed 
within one month after minimally invasive surgery. Three months post- 
surgery, a regenerated transparent biconvex lens structure had formed 
(Extended Data Fig. 8c, d). No significant VAO or other complications 
were observed at 6 months post-surgery (Table 1). 

All of the eyes gained visual function when the capsular bag was 
refilled with a regenerated lens of relatively uniform density. A clear 
view of the fundus was observed in all cases with successful lens regen- 
eration (Extended Data Fig. 8c, d). The average central thickness of 
the regenerated lenses increased significantly after surgery and was 
comparable to a native lens at 8 months post-surgery (Fig. 5a, P< 0.01). 
We also used retinoscopy and ophthalmoscopy to evaluate the func- 
tion of the regenerated lenses and found that from the first week to 
8 months post-surgery, the refractive power increased significantly 
(Fig. 5b, n=24, P<0.01). 

The accommodative ability of the regenerated lenses was evaluated 
8 months after surgery using an open-field autorefractor (see Methods). 
The mean accommodative response increased to 2.5 dioptres in 


Table 1 | Comparison of lens regeneration and complications in infants who received the new surgical treatment versus the current treatment 


Current treatment New treatment 
Total patients 25 12 
Total eyes 50 24 
Regenerated lens structure 0 24 
Healing and closure of capsular openings 0) 24 

Current treatment decimal acuity New treatment decimal acuity 
(logMAR) (logMAR) 
OD Os OD OS 

Baseline 0.008 (0.001) 0.008 (0.001) 0.008 (0) 0.008 (0.001) 
1 week 0.03 (0.009) 0.03 (0.010) 0.03 (0.007) 0.03 (0.008) 
3 months (before laser) 0.02 (0.017) 0.02 (0.022) 0.05 (0.014) 0.05 (0.017) 
3 months (after laser) 0.05 (0.013) 0.05 (0.018) ~ - 
6 months 0.11 (0.034) 0.11 (0.025) 0.10 (0.038) 0.11 (0.027) 

Current treatment New treatment Mean difference (95% Cl) Pvalue 
Overall complication rate 46 (0.92) 4 (0.17) 0.75 (0.57, 0.95) <0.001 
Corneal oedema 15 (0.30) 2 (0.08) 0.22 (0.02, 0.42) 0.04 
Anterior chamber inflammation 37 (0.74) 4 (0.17) 0.57 (0.35, 0.80) <0.001 
Macular oedema 3 (0.06) 0) 0.06 (—0.04, 0.16) 0.22 
Endophthalmitis 0 0) - - 
Retinal detachment 0) 0 - - 
Ocular hypertension 9 (0.18) 0) 0.18 (0.04, 0.32) 0.03 
Visual axis opacification 42 (0.84) 1 (0.04) 0.80 (0.64, 0.96) <0.001 
Additional laser surgery 42 (0.84) 0 0.84 (0.71, 0.97) <0.001 
Anterior vitrectomy 8 (0.16) 0 0.16 (0.03, 0.29) 0.04 


Summary statistics of decimal acuity measured at each time point and complication in infants who received the new surgical technique versus the standard of care. Mean (standard deviation) 
is reported for continuous variables and count (percentage) is reported for categorical variables. Cl, confidence interval; logMAR, logarithm of the minimum angle of resolution. OD, oculus dexter 


(right eye); OS, oculus sinister (left eye). 
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Figure 5 | Functional characteristics of regenerated human lenses. 

a, Lens thickness increased significantly 6 and 8 months after surgery 
(1.9 £ 0.3 and 3.7 0.3 mm, respectively, *P < 0.01), n= 24. b, Lens 
refractive power increased significantly 6 and 8 months after surgery 
(5.1+0.5 and 19.0 + 0.6 dioptres, respectively, *P <0.01).n=24. 

c, Visual acuity improved after surgery. logMAR, logarithm of the 
minimum angle of resolution. Pairwise analysis was performed to 
compare visual acuity before and after surgery (P < 0.05) OD (oculus 
dexter, right eye) n = 25; OS (oculus sinister, left eye) n= 12, OU 

(oculus uterque, both eyes) n= 12. d, Accommodative power increased 
significantly from 1 week (control OD, control OS, OD and OS, all 
0.1+0.1 dioptres) to 8 months (control OD and control OS, 0.2 +0.1 
dioptres; OD and OS, 2.5 + 0.2 dioptres) postoperatively (*P < 0.001). 
Control OD, n = 25; control OS, n= 12; OD, n= 12; OS, n= 12. 

e, Visual acuity was measured preoperatively and at 1 week, 3 months, 
and 6 months postoperatively. The majority of eyes in the control group 
underwent additional laser capsulotomy at 3 months after surgery, 

with visual acuity measured before and after the procedure. There was 

no significant difference in visual acuity between eyes that received 
minimally invasive surgery (m = 24) and those that received the current 
surgical technique (n = 50), except at 3 months before the control group 
underwent laser capsulotomy (t-test, ***P < 0.001). Data are shown 

as mean + s.d. f, Visual axis transparency was achieved in nearly all 
cataractous infant eyes after minimally invasive surgery (95.8%). The scar 
tissue of the wound on the anterior capsule was <1.5 mm in diameter and 
located in the periphery, away from the visual axis. The scars were not 
visible unless the pupils were dilated. No disorganized tissue regeneration 
was observed. Compared with the current standard surgical method, the 
new surgical technique decreased VAO by >20-fold. 
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regenerated lenses, which was markedly improved compared to the 
0.10 dioptre increase in aphakic controls (P < 0.001). Grating acuities 
(cycles/degree) was recorded preoperatively and at each postoperative 
follow-up appointment, then converted to the logarithm of the 
minimum angle of resolution. Infant visual acuity and accommodation 
power were significantly improved postoperatively compared to the pre- 
operative baseline (P< 0.001) (Fig. 5c, d). The increase in visual acuity 
was comparable to that achieved using the current surgical method 
(Fig. 5e). Thus, visual function testing showed that the regenerated 
lenses were functional. 


Clinical outcome with new or current treatment approach 
It is well known that with the current method for paediatric cataract sur- 
gery, VAO will occur in nearly all patients weeks or months postopera- 
tively owing to the abnormal proliferation of residual LECs”° (Table 1). 
To avoid VAO, additional procedures such as polishing of the lens 
capsule, laser capsulotomy, PCCC, and anterior vitrectomy are widely 
practiced to disrupt LECs, the lens capsule on which LECs proliferate, 
and aberrant lens fibre regeneration. Although these procedures can 
decrease VAO incidence by 15%, they carry significant risk of post- 
operative inflammation and complications. In this clinical trial, our 
minimally invasive surgical method resulted in visual axis transparency in 
nearly all eyes (95.8%) (Fig. 5f, Table 1). Since the scar from the ACCC 
was <1.5mm in diameter and located in the periphery of the anterior 
capsule, it was far from the visual axis (Fig. 5f) and not visible unless 
the pupils were dilated. The preserved lens capsule remained nearly 
entirely transparent (Fig. 5f). No disorganized tissue regeneration was 
observed. Thus, compared to the current standard of care for cataract 
surgery, our new minimally invasive technique decreased VAO by 
more than 20-fold (84% versus 4.2%). Furthermore, there was an intact 
posterior capsule and lens—vitreous interface (Table 1). 

By using paired t-tests within each group, significant improvement of 
decimal acuity before and after treatment was observed with P< 0.001 
(t= 23.40, degrees of freedom (d.f.) = 49.04) in the standard-of-care 
group and P< 0.001 (t=15.05, d.f.=23.01) in the novel treatment 
group. A linear mixed-effect model using decimal acuity as the out- 
come (time: baseline, 1 week, 3 month (after surgery for control group)) 
and treatment assignment and their interaction as fixed effects yielded 
statistically insignificant result for time and treatment interaction by 
likelihood ratio test with P= 0.956 (y? =0.332, d.f.=3) (Extended 
Data Table 1a, left, and Extended Data Table Ic, left)), which indicated 
that the mean response profiles for the two groups were parallel over 
time. We then refit the linear mixed-effect model by dropping out the 
interaction term (Extended Data Table 1b, left). A likelihood ratio test 
with insignificant P value of 0.776 (y” =0.081, d.f.=1) (Extended Data 
Table Ic, left), suggested that the difference between mean decimal 
acuity in two groups was not statistically different over time (Extended 
Data Fig. 8b). In contrast, the linear mixed-effect model using decimal 
acuity as the outcome (time: baseline, 1 week, 3 month (before surgery 
for control group)) and treatment assignment and their interaction as 
fixed effects yielded statistically significant results for time and treat- 
ment interaction with P < 0.001 (vy? =47.529, d.f.=3) (Extended Data 
Table la, right, and Extended Data Table Ic, right). The non-parallel 
pattern of mean responses from two groups was largely due to vision 
loss at 3 months before laser surgery in the control group, while the dec- 
imal acuity was monotonically increased in the novel treatment group 
(Extended Data Fig. 8b). The novel treatment also shows significantly 
lower complication rate by almost every measurement, supporting the 
superiority and safety of the novel treatment (Table 1). 


Discussion 

Each year, more than 20 million cataract patients worldwide undergo 
treatment with lens extraction and artificial IOL implantation®. Despite 
the clinical success of IOLs, they have numerous limitations and poten- 
tial complications, including IOL dislocation, suboptimal biocompat- 
ibility, inadequate accommodation, and poor visual outcomes. In the 
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worst cases, irreversible blindness may result*’. Thus, a new strategy 
for treating congenital cataracts using naturally regenerated lenses is 
highly desirable. 

The current surgical procedure for paediatric cataracts impairs 
lens regeneration in several ways. First, the commonly used ACCC 
creates a relatively large opening at the centre of the anterior capsule, 
prolonging recovery time and increasing the incidence of inflammation, 
while wound healing may form scars and cause postoperative VAO. 
Second, the surgical procedure removes most of the anterior subcap- 
sular LECs, of which a subpopulation is critical for lens regeneration”. 
Third, abnormal proliferation of residual LECs causes postoperative 
VAO in many cases’, which requires opening of the posterior 
capsule, performed by either laser capsulotomy or PCCC and anterior 
vitrectomy~*, By destroying the integrity of the lens capsule and LECs, 
the current surgical procedure greatly diminishes the possibility of 
lens regeneration”®. 

In this study, we show that a new minimally invasive cataract surgery 
method preserves the integrity of the lens capsule and associated LECs, 
facilitating functional lens regeneration in animals and humans. In addi- 
tion to achieving lens regeneration in patients with congenital cataracts, 
our method also increased visual axis transparency and decreased the rate 
of complications. The small capsulorhexis opening healed quickly with a 
nearly intact lens capsule and minimal postoperative inflammation (Fig. 5f, 
Extended Data Fig. 8c-e). After surgery, there was a clear cornea, anterior 
chamber, and fundus, with no surgery-related complications. 

Our method resulted in visual axis transparency in >95% of catarac- 
tous eyes in infants, a much higher percentage than that obtained by 
traditional surgery. In the remaining patient with VAO, some degree 
of opacification of the regenerated lens occurred that correlated with 
improper healing of the anterior capsulorhexis opening and loss of LECs. 
Therefore, visual axis transparency necessitates preservation of the integ- 
rity of the lens capsule and associated LECs, as was illustrated by our 
lens regeneration study in rabbits. Furthermore, we found that BMI-1 
is required for maintenance and renewal of endogenous LECs, and that 
loss of BMI-1 leads to reduction of the proliferative capacity of LECs 
and cataract formation. The fact that LECs in adult human eyes exhibit 
increased proliferative potential after injury highlights the potential for 
LEC replenishment beyond the paediatric population. These findings 
may therefore have implications for lens regeneration in elderly patients 
with age-related cataracts. However, there are important differences 
between paediatric and adult cataracts. Hard cataracts (nuclear scle- 
rosis) in adults may require phacoemulsification, which could damage 
LECs; in addition, tissue consistency and capsular thickness/elasticity 
may pose other challenges for adult lens regeneration. Furthermore, 
differences between the regenerative capacities of paediatric and adult 
lenses may suggest a prolonged period of regeneration in adults. 

In summary, the current surgical procedure for cataract treatment 
inadvertently destroys the integrity of the lens capsule and the very 
LECs that hold the regenerative key to lens restoration. It is also asso- 
ciated with numerous side-effects and a significant risk of complica- 
tions, particularly in infants. To overcome these problems, we have 
developed a new, minimally invasive surgical method that allows 
regeneration of a functional lens with refractive and accommodative 
abilities, and with greater visual axis transparency. This new cataract 
treatment uses endogenous stem cells to replenish the human ocular 
lens, and provides a fresh paradigm for organ and tissue regeneration. 
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METHODS 


Isolation and culture of LECs. All animal studies were performed with the 
approval of the Institutional Animal Care Committees of Sun Yat-sen University, 
the University of California San Diego, West China Hospital, and the University 
of Texas Southwestern Medical Center. 

The eyeball was enucleated from a one-month-old New Zealand white rabbit 
and washed with PBS (containing antibiotics) three times. After the cornea and 
iris were removed, a small cut was made in the posterior capsule of the lens; the 
capsule with attached epithelium was removed and cut into 1 x 1 mm/? pieces. The 
pieces of epithelium were cultured in minimum essential media supplemented with 
20% FBS, NEAA, and 50j1gml! gentamicin. 

A 17-week-old human fetal eyeball was purchased from Advanced Bioscience 
Resources, Inc. (San Francisco, California). Post-mortem human eyes were 
obtained from San Diego Eye Bank. The human LECs were cultured according to 
the same methods as above. 

For in vitro differentiation, LECs were cultured on Matrigel-coated six-well 

plates or eight-well chambers. Lentoid body was formed after 21 days in mini- 
mum essential media supplemented with NEAA, 1% FBS, 100ng ml! FGF2, and 
51g ml insulin. Images of lentoid tissue were obtained using a Leica M205FA 
stereo microscope. 
Transgenic mouse study. Membrane-tomato/membrane-green (mImG)-targeted 
ROSA™™"S mice were purchased from the Jackson Laboratory (Bar Harbour, ME; 
stock no. 7576) and maintained as homozygotes. P0-3.9-GFPcre mice expressing 
an eGFP-Cre recombinase fusion protein under the control of the Pax6 lens ecto- 
derm enhancer and the Pax6 PO promoter”° were maintained in a FVB/N back- 
ground. Lineage-tracing experiments were performed by crossing the homozygous 
ROSA™"™S reporter mouse strain with the PO-3.9-GFPcre deleter strain. Eyes were 
dissected at P1, P14, and P30 and fixed overnight in 4% formaldehyde. Tissues were 
then incubated in 10% sucrose and embedded in OCT for cryo-sectioning. Frozen 
sections were washed in PBS and imaged on a Zeiss Axio Imager fluorescence 
microscope. Bmil!' mice were generated as previously described”’. Nestin-cre 
mice”® were obtained from the Jackson Laboratory. For BrdU pulses, mice were 
injected with 100mgkg~! BrdU (Sigma) dissolved in PBS, then maintained on 
drinking water that contained 1 mg ml! BrdU until sacrifice. 

For gene expression studies, lenses of Pax6P0-3.9-GFPcre mice were dissected 
under a dissecting microscope. Lens capsular bag was opened from the posterior 
surface by making three crisscross incisions. The capsular bag was opens and lens 
material extruded. GFP-positive LECs in the mid-anterior capsular area were sepa- 
rated mechanically from GFP-negative LECs in the remaining capsular areas under 
a fluorescence microscope. RNA was isolated using RNeasy Mini Kit (Qiagen). 

To image cataracts, mice were anaesthetized with Avertin, and one drop of 1% 
Mydriacyl (Alcon) was administered per eye. Eyes were immediately visualized 
in vivo using a light microscope. For histology, mice were perfused with hepa- 
rinized saline followed by 4% paraformaldehyde (PFA) in PBS. Dissected eyes 
were fixed in 4% PFA overnight, embedded in paraffin, and sectioned by the UT 
Southwestern Molecular Pathology core facility. For BrdU staining, slides were 
deparaffinized, and subjected to heat-mediated antigen retrieval (in 10 mM 
sodium citrate, pH 6.0). Slides were stained with primary mouse anti-BrdU 
(Caltag, MD5000, 1:200) overnight at 4°C. Slides were subsequently stained 
with Alexa Fluor 555-conjugated goat anti-mouse IgG1 secondary antibody (Life 
Technologies, 1:500) and 1 mg ml! DAPI (1:500) for 1h at room temperature. The 
number of BrdU-labelled cells was divided by the total number of DAPI* cells in 
a single layer of LECs. 

Lentiviral RNAi. Lentiviral shRNA targeting the human BMI1 gene 
(NCBI Reference Sequence: NM_005180.8) was purchased from Origene (TL314462), 
ShRNA targeting sequences were as follow: 5/-AATGCCATATTGGTATATGA 
CATAACAGG-3’ and 5‘-GTAAGAATCAGATGGCATTATGCTTGTTG-3’. Two 
shRNAs were used separately, and a non-effective 29-mer scrambled shRNA was 
used as a control. Lentiviral shRNA particles were prepared using shRNA lenti- 
viral packaging kit (Origene, TR30022). Viruses were harvested at 48 h and 72 h 
post-transfection. 

Western blot analysis. LECs were cultured on Matrigel-coated 3.5-mm dishes with 
lentoid formation medium for 30 days. Cells were washed twice with ice-cold PBS, 
and lysed in RIPA lysis buffer with PMSF. Protein concentration was determined by 
BCA protein assay kit. Thirty micrograms of total protein lysate was loaded onto 
10% SDS-PAGE gel and then transferred to a PVDF membrane (Millipore) at 70 V 
for 2h. The membrane was probed with the following primary antibody at 4°C 
overnight: anti-cA-crystallin (sc-22389, Santa Cruz), anti-B-crystallin (sc-48335, 
Santa Cruz), anti-y-crystallin (sc-22415, Santa Cruz) and anti-(-actin (sc-47778, 
Santa Cruz), and then incubated with HRP-conjugated anti-rabbit, anti-mouse, or 
anti-goat secondary antibody for 1h at room temperature. The immunodetection 
was visualized using a blot imaging system (Fluor Chem Q, Protein Simple) with 
ECL buffer (Millipore). 
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Lens regeneration in rabbit and macaque models. New Zealand white rabbits 
(n= 29, four rabbits died from systemic infections unrelated to surgery. The remain- 
ing 25 rabbits were used to assess regeneration), and long-tailed macaques (Macaca 
fascicularis) monkeys (n= 6) underwent minimally invasive capsulorhexis surgery. 
Only the left eye of each animal was used for experiments. Slit-lamp biomicroscopy 
and photography were performed at different time points to monitor lens regen- 
eration. Rabbits were euthanized at day 1, day 7, and one month after surgery, 
and the treated eyes were enucleated. The lenses were harvested for histologic 
analysis using haematoxylin and eosin staining. For the macaques, enucleation of 
the treated eye was performed 4 months post-surgery and the lenses were harvested 
for the same histologic examinations. The eyes were fixed, paraffin-embedded, and 
sectioned at 5jum through the cornea, pupil, and optic nerve with the lens in situ. 
Real-time PCR. RNA was isolated from rabbit LECs, mature lens fibre cells and 
LECs in P0-3.9-GFPcre mice using an RNeasy Mini Kit (Qiagen) and subjected 
to on-column DNase digestion. cDNA was synthesized using a Superscript II 
reverse transcriptase kit according to the manufacturer’s instructions (Invitrogen). 
Quantitative PCR was performed via 40 cycle amplification using gene-specific 
primers (Supplementary Table 1) and Power SYBR Green PCR Master Mix on a 
7500 Real-Time PCR System (Applied Biosystems). Measurements were performed 
in triplicate and normalized to endogenous GAPDH levels. The relative fold change 
in expression was calculated using the AAC, method (C; values <30). 
Immunofluorescence and laser confocal microscopy. Rabbit LECs were fixed in 
4% PFA for 20 min, then permeabilized with 0.3% Triton X-100-PBS for 10 min 
and blocked in PBS solution containing 5% BSA, followed by an overnight incuba- 
tion in primary antibodies at 4°C. After three washes in PBS, cells were incubated 
with secondary antibody for 1 h in room temperature. Cell nuclei were counter- 
stained with DAPI. 

The following antibodies were used: goat anti-Sox2 polyclonal antibody (Santa 
Cruz), rabbit anti-PAX6 polyclonal antibody (PRB-278P, Covance), mouse anti- 
Bmil antibody (ab14389, Abcam), and mouse anti-Ki67 monoclonal antibody 
(550609, BD Sciences). The secondary antibodies, Alexa Fluor 488- or 568- 
conjugated anti-mouse or anti-rabbit IgG (Invitrogen), were used at a dilution 
of 1:500. Images were obtained using an Olympus FV1000 confocal microscope. 
BrdU labelling of LECs in humans. We used BrdU labelling to identify and quan- 
tify proliferating LECs from human cadaver eyes. Whole-mount human lens cap- 
sules were pulsed with BrdU and then stained with an antibody against BrdU 
to determine the distribution and density of proliferating LECs. In brief, within 
12-24h after death, lenses from post-mortem donor eyes were obtained from 
the Eye Bank of Zhongshan Ophthalmic Center in Guangzhou, China. Twelve 
lenses in total from six donors were used for the experiment. A small puncture 
injury was made on the anterior surface of a post-mortem human lens using a 
30-gauge needle. The lenses were cultured at 37°C in Dulbecco modified Eagle 
medium (DMEM) supplemented with 10% FBS in a humidified incubator with 
5% CQ. The contralateral lens from the same donor was treated under the same 
conditions but did not receive a puncture injury and was used as a control. To label 
the proliferating LECs, both groups of lenses were incubated in 100jgml~! BrdU 
(Sigma-Aldrich) 24h after the puncture injury. The lens was then removed from 
the capsular bag, and the lens capsules were fixed in 4% formaldehyde and sub- 
jected to BrdU staining using a standard immunohistochemistry protocol accord- 
ing to the manufacturer's instructions (CST, Boston, Massachusetts). Images were 
taken using a Carl Zeiss microscope (Jena, Germany). 

Study design, execution, and oversight of clinical trial in humans. This study was 
approved by the institutional review board of the Zhongshan Ophthalmic Center 
(ZOC). Informed written consent was obtained from the parents or guardians of 
the infants before enrolment, and the tenets of the Declaration of Helsinki were 
followed throughout the study. The study was conducted in accordance with an 
international guideline and protocol for visual function measurements in paedi- 
atric cataract surgery and a protocol of the Childhood Cataract Program of the 
Chinese Ministry of Health (CCPMOH) and had an independent data and safety 
monitoring board of ZOC-CCPMOH. 

Description of current surgical method for cataract extraction. The current 
standard-of-care treatment for paediatric cataract involves removal of the catarac- 
tous lens through a relatively large opening using anterior continuous curvilinear 
capsulorhexis (ACCC, about 6 mm in diameter, Extended Data Fig. 1), followed 
by cataract extraction and artificial lens implantation or placement of postoper- 
ative aphakic eyeglasses/contact lens in paediatric cataract patients younger than 
two years. Some patients underwent additional posterior continuous curvilinear 
capsulorhexis (PCCC) and anterior vitrectomy. 

Establishment of a minimally invasive capsulorhexis surgery method to 
preserve LECs. We established a new capsulorhexis surgery method to facilitate 
lens regeneration (Fig. 3a). First, we decreased the size of the capsulorhexis opening 
to 1.0-1.5mm in diameter. This results in a minimal wound of about 1.2 mm? 
in area, which is only about 4.3% the size of the wound created by the current 
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method. Second, we moved the location of the capsulorhexis to the peripheral area 
of the lens instead of the central area. A 0.9mm phacoemulsification probe was 
used to remove the lens contents and/or cortical opacities. These changes provide 
significant advantages. First, it considerably reduces the size of the injury, which 
resulted in a lower incidence of inflammation and much faster healing. Second, it 
moves the wound scar away from the central visual axis to the periphery, leading 
to improved visual axis transparency. Third, it preserves a nearly intact transparent 
lens capsule and layer of LECs, which have regenerative potential and are critically 
required for the regeneration of a natural lens. 

Clinical trial of minimally invasive lens surgery in human infants with 
congenital cataract. The clinical trial is an open label, randomized controlled trial in 
a study population of paediatric cataract patients (age: 0-2 years). Except the trial 
participants, all other parties (care providers, outcome assessors) were blinded to 
treatment allocation. A clinical trial consort flowchart is listed in the Extended 
Data Fig. 8a. Paediatric patients were enrolled accordingly inclusion and exclusion 
criteria below (ClinicalTrials.gov identifier: NCT01844258). Inclusion criteria were 
the following: infants were <24 months old, and diagnosed with bilateral uncompli- 
cated congenital cataract with an intact non-fibrotic capsular bag. Exclusion criteria 
included preoperative intraocular pressure (IOP) >21 mm Hg, premature birth, 
family history of ocular disease, ocular trauma, or other abnormalities, such as 
microcornea, persistent hyperplastic primary vitreous, rubella, or Lowe syndrome. 
In total, twelve paediatric cataract patients (24 eyes) received the new minimally 
invasive lens surgery (Table 1). Twenty-five paediatric cataract patients (50 eyes in 
total) were enrolled as the control group to receive the current standard surgical 
treatment (Extended Data Fig. 8a). Bilateral eye surgeries of the same patient were 
conducted during the same operation session. 

We defined the incidence of corneal oedema as a >5% increase in central cor- 
neal thickness one week post-surgery, and the incidence of severe anterior cham- 
ber inflammation as a Flare value >10 evaluated by Pentacam system (OCULUS, 
Germany) and Laser flare meter (KOWA FM-600, Japan). Early-onset ocular 
hypertension was identified as IOP >21 mm Hg by Tonopen (Reichert, Seefeld, 
Germany) within one month after surgery. Macular oedema was identified by fun- 
dus OCT (iVue, Optovue, Germany) as an increase in central macular thickness 
>10% one week post-surgery. When indicated, VAO, defined by visual decline 
and the degree to which the fundus was obscured, was treated with YAG laser 
capsulotomy at follow-up. 

Compared to infants operated on using our new surgical technique, infants 

who received the traditional technique had a higher incidence of anterior cham- 
ber inflammation one week after surgery, early-onset ocular hypertension, and 
increased VAO (Table 1). However, in the group treated with our new method, a 
transparent regenerated biconvex lens was found in 100% of eyes three months 
after surgery, while no regenerated biconvex lenses formed in the group treated 
with the standard technique. In addition, 100% of the capsular openings healed 
within one month after surgery in the experimental group, but no capsular 
openings healed in the control group. 
Evaluation of paediatric visual acuity. Testing equipment included a set of Teller 
Acuity Cards (Vistech Consultants, Dayton, Ohio). The set of cards consists of 
15 cards with gratings ranging in spatial frequency from 0.32 to 38 cycles per cm, 
in half-octave steps, and one blank grey card. A 4-mm peephole in each card allows 
the tester to view the child’s face through the card during testing. Test distance 
was kept constant by use of an aid to measure the distance from the child’s eyes to 
the card throughout testing. For 38 cm, the aid was the distance measured from 
the tester’s elbow to a specific knuckle on the tester’s hand, and for 55 cm, the aid 
was the length (55cm) of the Teller Acuity Card. Testers were instructed to hold 
the cards without wrapping their fingers around the front side of the card, as this 
may attract the child’s attention. Testers presented the cards directly in front of 
the child and observed the child either over the top of the card or through the 
peephole in the card. 

During each acuity test, a masked visual acuity examiner was aware that the 
gratings were arranged in order from lower to higher spatial frequencies in half- 
octave steps, but were masked to the absolute spatial frequency of the grating 
on each card. The subset of spatial frequencies used for each test was selected 
according to a pseudorandom order from among three possible subsets of spatial 
frequencies for the subject’s age group. All three subsets for each age group included 
spatial frequencies known to be well above the threshold for that age group. To 
keep the visual acuity examiner masked to the absolute spatial frequency, the visual 
acuity examiner was not permitted to look at the front of the card to confirm the 
location of the grating. Instead, the visual acuity examiner asked an assistant to 
confirm the location of the grating on the card, after the visual acuity examiner 


had shown a card to the subject enough times to assess whether or not the subject 
could detect the grating. A clinical examiner was masked to the acuity results and 
the assigned patient group. Acuity was scored as the spatial frequency of the finest 
grating and was converted to log values before data analysis. 

Measurement of lens refractive power. We used a handheld auto-refractometer 
(PlusoptiX A09, OptiMed, Sydney, Australia) to evaluate the function of the regen- 
erated lenses according to the manufacturer's methods. 

Statistical analysis. Descriptive statistics was provided for the primary and sec- 
ondary endpoints measured by intervention groups at each time point. Mean and 
standard deviation was reported for continuous variables and count and percent- 
age is reported for categorical variables. To assess whether the primary outcome, 
decimal acuity, was significantly improved within each group, we performed the 
pre-post comparison between decimal acuity measured at baseline and study end- 
point using paired t-tests. Normality of the data was checked and non-parametric 
alternatives, Wilcoxon signed-rank test is considered if the assumption was severely 
violated. To evaluate whether the mean response profiles in two groups were sim- 
ilar, we used the linear mixed-effect model taking account for the within-subjects 
correlation. The baseline decimal acuity was not adjusted by the model due to 
the homogeneity of this measurement as shown in the summary statistics. As 
the standard-of-care approach requires laser surgery at 3 months while the novel 
treatment does not, we fit two models using before and after laser surgery data, 
separately, to demonstrate the superiority of the novel approach. In each model, 
the outcome is the decimal acuity measured at four time points: baseline, 1 week, 
3 months (before or after laser surgery) and 6 months; time (baseline as the refer- 
ence level), treatment assignment and their interaction are the fixed effects; and 
patient is the random effect. Significant associations are identified using likelihood 
ratio test (LRT) by comparing models with and without a fixed effect. A linear 
mixed-effect model is fit again by dropping out the insignificant fixed effect until 
the final model is selected. A contrast test is performed when necessary. For the 
secondary aim, we compared the proportions of each condition of complications 
between two groups. We assumed the occurrence of complications for eyes from 
the same patient were independent. The mean difference and its 95% confidence 
interval was reported. A two-proportion z-test was used with the nonparametric 7 
test as alternative if the normality assumption was violated. All tests were two-sided 
and a Pvalue less than 0.05 is considered to be statistically significant. 
Evaluation of accommodative response. Accommodative response was measured 
by an open-field autorefractor (SRW-5001K; Shin-Nippon, Tokyo, Japan), which 
allows targets to be viewed at any distance. The paediatric patients were positioned 
for autorefractor measurement with assistance from their parents. The patients 
were guided to fixate binocularly at a near target (33 cm, 5 x 5 array of smiley 
faces of N10 size) and a far target (3m, 5 x 5 array of smiley faces of N10 size) by 
a trained and certified investigator or study coordinator. The measurements from 
non-cycloplegic autorefraction were performed three times at each target distance 
by the same trained and certified investigator throughout the study, in order to 
maintain accuracy and consistency throughout the trial. Measurements were taken 
in the same quiet environment with consistent room illumination to diminish 
the influence of distracting factors and to maintain subjects’ concentration. The 
spherical equivalent refractive value (SER) was recorded for each measurement and 
the mean value was calculated for evaluation of an accommodative response. The 
value of accommodative response was the difference between SER values for the 
near and the far target. We also used dynamic retinoscopy to measure the infants’ 
accommodation”*-*!, In brief, we recorded a lens dioptre value using retinoscopy 
when a patient was guided to fixate on a target 3 m away. Then another lens dioptre 
value was recorded when the target was moved closer, at a distance of 33 cm from 
the eyes. The difference between these two measurements was used to evaluate 
lens accommodative power. 
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2 years after surgery 


B 4 years after surgery 


Current Pediatric Cataract Surgery 
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PCC, < Lens X . 
4mm Capsule _ «-Donut-like 
Regenerated Lens 
Extended Data Figure 1 | Surgical methods and lens regeneration for years post-surgery, indicating continued growth of the regenerated lens. 
congenital cataract. a, b, Slit-lamp photography of ‘doughnut-like’ lens There was also the complication of iridolenticular synechiae. c, Schematic 
regeneration at different time points after treatment using the current diagrams of the current surgical method for paediatric cataracts: the 
surgical method. Two years after surgery (a), the transparent regenerated currently practiced paediatric ACCC creates an opening 6 mm in diameter 
lens tissue contained the sealed capsular opening with an opaque white at the centre of the anterior capsule, removing the LECs underneath it and 
scar at the centre. The regions between the dashed circles indicated by leaving a relatively large wound area of 28 mm”. The scars formed often 
the red arrows are the regenerated lens tissues. Four years after surgery cause postoperative VAO. Additionally, PCCC and anterior vitrectomy are 
(b), the capsular opening was constricted compared to that seen at two commonly performed at follow-up visits. 
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umber of Brdu* Cells 


Extended Data Figure 2 | BrdU pulse labelling of human LECs. 

a, Whole mount of a human lens capsule showing BrdU cells (brown) 

by enzymatic immunohistology and diaminobenzidine staining. b, High- 
magnification images of human donor lenses showing BrdU* LECs. c, Bar 
graph showing quantification of BrdU* cells. There was an age-dependent 
decrease in the number of BrdU* cells (8 months, 39.9 + 8.1; 30 years, 
20.3 + 7.3 and 40 years, 5.9 + 2.9; 8 months versus 40 years, *P < 0.05). 

Six randomly chosen fields of each capsule were used for analysis, four 
samples in each group, (n = 24 fields, chosen over four samples). d, High- 
magnification images of whole-mount staining of human lens capsules 
with or without injury showed a marked increase in the number of BrdU* 


(Fold) 


g 
: 
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cells after injury. e, Bar graph showing quantification of BrdU* cells. The 
contralateral eyes from the respective donors were used as controls. There 
was a significant increase in number of BrdU* cells. No injury, 1.5 + 1.2; 
after injury, 18.4 + 4.2; fold change after injury, 11.3 + 2.5; *P < 0.05. Six 
randomly chosen fields within the germinative zone of each capsule were 
used for analysis, five samples in each group (n = 30 fields, chosen over 
five samples). Data shown as means + s.d. f, Cultured human fetal LECs 
were positive for BMI-1 (green, right upper panel); co-staining of PAX6 
(red) and Ki67 (green), middle panels; co-staining of SOX2 (red) and Ki67 
(green), lower panels. g, Co-staining of PAX6 (red) and SOX2 (green) of 
human fetal LECs. All scale bars, 100j1m. 
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Extended Data Figure 3 | Conditional deletion of Bmi-1 led to decrease 
in Pax6* and Sox2* cells and cataract formation. A, Loss of Bmi-1 
reduced the Pax6* and Sox2* LECs population. a, Representative images 
of haematoxylin and eosin-stained lens sections from Bmil“"' control 
mice and Nestin-cre;Bmil/' mice. b, Representative images of Bmi-1 (red) 
staining in LECs. c, Pax6 (red) and Sox2 (green) immunostaining. 

d, Percentage of Pax6t (Bmil™", 88.5 + 2.9%; Nestin-cre;BmilM!, 
2.4+2.3%) and Sox2* (Bmil!", 82.7 + 3.9%; Nestin-cre;Bmi lM, 

4.9 + 1.5%) cells (n=5 mice; 5 sections counted per mice, for a total of 

25 sections across 5 mice), *P< 0.001. Data are shown as mean +s.d. 

B, Conditional deletion of Bmil led to reduced LEC proliferation. The 
percentage of BrdU* LECs per eye is shown (2M: Bill, 2.6 + 0.9%; 
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Nestin-cre;Bmil!""", 3.0 + 0.4%; n=4 mice. 7M: Bmil", 1.5 +0.2%; 


Nestin-cre;Bmil!", 0.6 + 0.4%; n=6 mice. 12M: Bmil", 1.8 + 0.6%; 
Nestin-cre;Bmil, 0.2 40.2%; n=8 mice), two sections counted per eye. 
Statistical significance was assessed using a two-tailed Student's t-test. 

*P <0.05. Data are shown as mean +s.d. C, Nestin (green) staining is 
shown in E13.5, E18.5, and 2-month-old wild-type mice. All scale bars, 
100 1m. D, Representative images of lenses from Nestin-cre;Bmil! and 
Bmilf! control mice. a, Cataracts are evident in 7- and 12-month-old 
Nestin-cre;Bmil!' mice (arrow). b, Deletion of Bmi-1 at 6 weeks of age 
with Nestn-creER did not recapitulate the cataract phenotype 10 months 
after tamoxifen treatment. Haematoxylin and eosin-stained sections of the 
same eyes are also shown. All scale bars, 100 1m. 
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Extended Data Figure 4 | Loss of BMI-1 decreased the proliferative 
ability of LECs. a, Phase-contrast photographs of human LECs (upper 
panels) and quantification of Ki67* proliferating human fetal LECs upon 
BMI1 knockdown (shBMI1) compared to controls (two shRNAs gave 
similar results; n =5, P< 0.05). Data shown as mean + s.d. Blue indicates 
DAPI staining. b, Loss of BMI-1 did not significantly affect expression of 
LEC or lens fibre cell makers in LECs. BMIJ was reduced by 3.3-fold 
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(all n=3, P< 0.05); gene expression changes of LEC markers were: 
1.3-fold increase (PAX6), 1.1-fold increase (SOX2), 1.3-fold increase 
(C-MAF) and 1.1-fold increase (E-cadherin); gene expression changes of 
differentiated lens fibre cell markers were: 1.6-fold increase (Filensin), 
0.9 fold increase (CP49) and 1.4-fold decrease (CRYBA2). Two different 
shRNAs gave similar results; n =5, P< 0.05. Data shown as mean + s.d. 
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Extended Data Figure 5 | Higher expression levels of Bmil, Sox2 

and Ki67 in Pax6+ LECs. a, Pax6é—GEP* LECs were observed at the 
germinative zone. Left panel, a section of lens of a Pax6P0-3.9-GFPcre 
mouse at P1. Middle and right panels, higher magnification of the framed 
area in the left panel. Blue indicates DAPI staining. b, Upper panel, bright- 
field photograph showing flat-mount preparation of a lens capsule of a 
Pax6P0-3.9-GFPcre mouse at 6 months; lens capsule materials between 
two red circles were dissected to enrich Pax6—GFP* LECs. Lower panel, 


PAX6 


Ki67 BMI-1 SOX2 


fluorescence image of GFP* LECs from the framed area in the upper 
panel. AC, anterior capsule; PC, posterior capsule. c, Comparison of gene 
expression levels in Pax6-GFP* LECs versus GFP” LECs in anterior lens 
capsule in 6-month-old mice, increased expression of the following genes 
were observed: 10.1-fold in Pax6 (P< 0.005), 8.2-fold in Ki67 (P < 0.05), 
4.3-fold in Bmil (P< 0.05), and 2.6-fold in Sox2 (P< 0.05), all n=5. Data 
shown as mean + s.d. 
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Extended Data Figure 6 | Lens regeneration surgery in rabbits. arrow). d, A blunt needle was used to inject balanced salt solution for 

a, A 3.2-mm keratome was used to make a limbus tunnel incision at the hydrodissection of the cortex from the anterior capsule. e, The cortex was 
11-12 oclock position into the anterior chamber. b, The capsular opening removed using a phacoemulsification device. f, The remaining cortex was 
was created by a capsulorhexis needle. c, A 1-2 mm diameter anterior removed using irrigation and aspiration. h, An elbow I/A handle was used 
capsulotomy was performed using the anterior continuous curvilinear to clear the equatorial cortex. i, j, The limbus wound was sutured with an 
capsulorhexis (ACCC) technique near the capsular opening area (yellow interrupted 10-0 nylon suture. The wound was found to be watertight. 
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Extended Data Figure 7 | Lens regeneration in rabbits. a, Haematoxylin 
and eosin staining of regenerated lenses at different time points after 
surgery. At postoperative day 1, a monolayer of LECs between the anterior 
and posterior capsules was visible (arrowheads). At postoperative day 4, 
LECs proliferated and covered the posterior capsule. At postoperative 

day 7, LECs in the posterior capsule began to elongate and differentiate. 

b, At postoperative day 28, LECs in the posterior capsule further 
elongated, forming primary lens fibres. c, Transparency and shape of 
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regenerated lenses in rabbits. Upper panel, slit-lamp photography of 

a regenerated lens at different time points after surgery. Lower panel, 
schematic diagram of slit-lamp photographs in the upper panel. At day 1 
after surgery, the capsular opening was clearly seen in the peripheral 
anterior capsule, and the area of LEC loss during surgery is indicated. 

At 7 weeks after surgery, loss of LECs led to adhesion between the anterior 
and the posterior capsule and inhibition of lens regeneration in this area. 
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Extended Data Figure 8 | Human lens regeneration. a, A clinical trial 
consort flowchart. b, Comparison of visual acuity mean response profiles 
in two groups. A non-parallel pattern of mean responses between two 
groups was observed largely due to the vision loss at 3 months before 
laser surgery in the control group (left panel), whereas a parallel pattern 
of mean responses between two groups was observed using time points 
including 3 months after laser surgery (right panel); n = 25 control, 

n= 12 experimental. Data are shown as mean + s.d. ¢, Lens thickness 
increased after surgery. Pentacam showed that 3 months after surgery, 
the regenerating lens tissue grew from the periphery of the capsular bag 
to the centre. The sealed capsular bag was only partially filled, appearing 
spindle-shaped on cross-sectional scan. The fundus was clearly visible on 
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ophthalmoscopy. Arrowheads indicate the regenerated lens structure. 

d, Six months after surgery, the capsular bag was filled with regenerated 
lens tissue and appeared biconvex on cross-sectional scan by Pentacam. 
The anterior—posterior capsular adhesion disappeared. The fundus 

could be seen clearly using an ophthalmoscope with an 18-dioptre lens. 
e, Minimally invasive capsulorhexis preserved LECs for lens regeneration 
in human infants. Top panel, slit-lamp exam demonstrating human infant’s 
eye visual axis transparency 6 months after minimally invasive surgery 
compared to baseline (before cataract surgery). Bottom panel, retro- 
illumination demonstrating the reduced size of the capsulorhexis 

(white arrowheads). 
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Extended Data Table 1 | Clinical Outcome Analysis 
Extended Data Table 1a. Linear mixed-effect model with decimal acuity as outcome; time, treatment and their interaction as fixed effects; and 


patient as random effect. 


baseline, 1 week, 3 months after surgery and 6 months 


Estimate 
(Intercept) 0.008 
1 week 0.022 
3 months 0.042 
6 months 0.099 
Trmt (Novel) 0 
1 week*Trmt -0.003 
3 months*Trmt 0 
6 months*Trmt -0.001 
Random effect 0.008 
-2logL 1509.948 


Std.Error Z test 
0.003 2.97 
0.003 6.926 
0.003 13.409 
0.003 31.722 
0.005 -0.024 
0.005 -0.494 
0.005 -0.036 
0.005 -0.093 


Pr(>|Z]) 
0.003** 
<.001*** 
<.001*** 
<.001*** 
0.981 
0.621 
0.971 
0.926 
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baseline, 1 week, 3 months before surgery and 6 months 


Estimate 
(Intercept) 0.008 
1 week 0.022 
3 months 0.011 
6 months 0.099 
Trmt(Novel) 0 
1 week*Trmt -0.003 
3 months*Trmt 0.031 
6 months*Trmt -0.001 
Random effect 0.01 
-2logL 1497.237 


Std.Error 
0.003 
0.003 
0.003 
0.003 
0.005 
0.006 
0.006 
0.006 


Z test 

2.781 

6.859 

3.35 

31.417 

-0.023 

-0.49 

5.619 

-0.092 


Pr(>|Z|) 

0.005** 
<.001*** 
<.001*** 
<.001*** 
0.982 

0.624 
<.001*** 
0.927 


Extended Data Table 1b. Linear mixed-effect model with decimal acuity as outcome; time and treatment as fixed effect; and patient as random 


baseline, 1 week, 3 months after surgery and 6 months 


effect. 
Estimate 
(Intercept) 0.008 
1 week 0.021 
3 months 0.042 
6 months 0.099 
Trmt(Novel) -0.001 
Random effect 0.008 
-2logL 1536.077 


Std.Error Z test 
0.003 3.346 
0.003 8.126 
0.003 16.374 
0.003 38.731 
0.004 -0.276 


Pr(>|Z]) 
<.001*** 
<.001*** 
<.001*** 
<.001*** 
0.783 


baseline, 1 week, 3 months before surgery and 6 months 
Estimate 
0.006 
0.021 
0.021 
0.099 
0.007 
0.009 
1476.647 


(Intercept) 

1 week 

3 months 

6 months 
Trmt(Novel) 
Random effect 
-2logL 


Std.Error 
0.003 
0.003 
0.003 
0.003 
0.004 


Extended Data Table 1c. Likelihood ratio test of fixed effects based on the analysis of response profiles 


baseline, 1 week, 3 months after surgery and 6 months 


Time*Treatment 
Time 
Treatment 


DF 
3 
3 
1 


Chi-Squared P-value 
0.322 0.956 
532.308 <.001*** 
0.081 0.776 


Time*Treatment 
Time 
Treatment 


DF 


3 
1 


Chi-Squared 
47.529 
495.562 
3.089 


Z test 
2.107 
7.347 
7.313 

35.017 
1.746 


baseline, 1 week, 3 months before surgery and 6 months 


P-value 
<,001*** 
<.001*** 

0.079 


Pr(>[Z]) 
0.035* 
<.001*** 
<.001*** 
<.001*** 
0.081 


a, b, Linear mixed-effects model (decimal acuity as outcome; time, treatment and their interaction as fixed effects; and patient as random effect). c, Likelihood ratio test of fixed effects based on the 


analysis of response profiles. 
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Observing cellulose biosynthesis and 
membrane translocation in crystallo 


Jacob L. W. Morgan!*, Joshua T. McNamara!*, Michael Fischer?+, Jamie Rich?+, Hong-Ming Chen’, Stephen G. Withers? & 


Jochen Zimmer! 


Many biopolymers, including polysaccharides, must be translocated across at least one membrane to reach their site of 
biological function. Cellulose is a linear glucose polymer synthesized and secreted by a membrane-integrated cellulose 
synthase. Here, in crystallo enzymology with the catalytically active bacterial cellulose synthase BcsA-BcsB complex 
reveals structural snapshots of a complete cellulose biosynthesis cycle, from substrate binding to polymer translocation. 
Substrate- and product-bound structures of BcsA provide the basis for substrate recognition and demonstrate the stepwise 
elongation of cellulose. Furthermore, the structural snapshots show that BcsA translocates cellulose via a ratcheting 
mechanism involving a ‘finger helix’ that contacts the polymer’s terminal glucose. Cooperating with BcsA’s gating loop, 
the finger helix moves ‘up’ and ‘down’ in response to substrate binding and polymer elongation, respectively, thereby 
pushing the elongated polymer into BcsA’s transmembrane channel. This mechanism is validated experimentally by 
tethering BcsA’s finger helix, which inhibits polymer translocation but not elongation. 


Cellulose is an abundant structural cell component produced by many 
organisms, including bacteria, vascular plants and animals!~*, It is a 
linear polymer of glucose molecules joined between their C1 and C4 
carbons. Cellulose is synthesized by membrane-integrated glycosyl- 
transferases (GTs) that contain 6 to 8 transmembrane helices (TMHs) 
as well as an intracellular catalytic GT domain®. These enzymes poly- 
merize UDP-activated glucose (UDP-Glc)”* into chains thousands 
of glucose units long’ and translocate the polymer across the plasma 
membrane, through a pore formed by their own transmembrane 
region!”. 

Cellulose is also a common biofilm component”"! where it is syn- 
thesized and secreted via an inner and, in Gram-negative bacteria, 
outer membrane-spanning cellulose synthase complex’. At the inner 
membrane, the catalytic BcsA and membrane-anchored, periplasmic 
BcsB subunits form a complex sufficient to synthesize and translocate 
cellulose’, while transport across the outer membrane probably occurs 
through the BesC subunit’®’. 

Processive glycosyltransferases, including chitin, alginate and cellu- 
lose synthases, transfer the glycosyl moiety from a nucleotide-activated 
sugar (donor) to a specific hydroxyl group of the growing polysaccha- 
ride chain (acceptor) by a nucleophilic $y2-like substitution reaction", 
thereby forming an elongated polymer and nucleoside diphosphate 
as reaction products. A processive mechanism requires that the elon- 
gated polymer is translocated after each glycosyl transfer, such that the 
polymer’s newly added sugar unit becomes the acceptor in a subse- 
quent reaction. Because all known processive glycosyltransferases are 
transmembrane channel-forming enzymes!*!”, the translocation of 
the polymer into the transmembrane channel between catalytic steps 
also gives rise to secretion. 

Previous structural and functional analyses of the Rhodobacter 
sphaeroides BcsA-BcsB complex containing a nascent cellulose poly- 
mer revealed the architecture of the active site, its close association with 
the transmembrane channel, as well as the coordination of cellulose 
within the channel”. In bacteria, cellulose biosynthesis is activated by 


the signalling molecule cyclic-di-GMP (c-di-GMP)'§, a potent biofilm 
inducer and allosteric activator of BcsA'’. Binding of the activator to 
BcsAss carboxy-terminal PilZ domain allows a “gating loop” to either 
insert into the catalytic pocket during substrate binding or to retract 
from it to release the UDP product””. 

Cellulose synthases contain a short helix within the GT domain, 
termed “finger helix””°. The amino terminus of the finger helix con- 
tacts the polymer’s acceptor glucose via an invariant “TED” motif, of 
which the aspartic acid (D) probably facilitates the deprotonation of 
the acceptor C4 hydroxyl during catalysis!°”°. 

Crystal structures of the catalytically inactive “resting” state of 
BcsA-BcsB (in the absence of c-di-GMP)"° and a c-di-GMP-activated 
structure”® provided important insights into the architecture and 
function of processive glycosyltransferases. Here we used in crystallo 
enzymology to obtain structural snapshots of a complete cellulose 
biosynthesis reaction cycle, providing structures of substrate- and 
product-bound states, and delineating the mechanism by which the 
elongated glucan is translocated into BcsA‘s transmembrane channel. 


BcsA elongates the chain one glucose unit at a time 
The previously determined c-di-GMP-activated BcsA-BcsB struc- 
ture’’ contains a nascent cellulose polymer 18 glucose molecules long 
whose non-reducing terminal glucose unit rests at the entrance to 
BcsA’s transmembrane channel, which is marked by the invariant 
Trp383 of the “QxxRW” motif?°"!. In this state, BcsA’s finger helix is 
in an ‘up’ position where it points towards the entrance of the trans- 
membrane channel, thereby positioning Asp343, the putative catalytic 
base, near the C4 hydroxyl of the polymer’s terminal sugar (Fig. 1a). 
Because BcsA’s active site is empty and the gating loop is retracted 
from it, this structure represents a state in which the enzyme is poised 
to initiate a new cycle of chain elongation, hereafter referred to as the 
‘post-translocation state’. 

Notably, BcsA-BcsB is catalytically active in crystallo. Incubating 
BcsA-BesB crystals with UDP-Glc in the absence of Mg?* (to slow 


1University of Virginia School of Medicine, Center for Membrane Biology, Molecular Physiology and Biological Physics, 480 Ray C. Hunt Drive, Charlottesville, Virginia 22908, USA. @Department of 
Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z1, Canada. +Present Addresses: Sandoz GmbH, Biochemiestrasse 10, A-6250 Kundl, Austria (M.F.); 
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down the reaction) results in the extension of the polymer’s electron 
density by one glucose unit (Fig. 1a). This demonstrates that cellulose 
elongation occurs via a stepwise addition of glucose units and that 
Trp383 of the QxxRW motif indeed forms the acceptor-binding site. 
The elongated polymer points straight into the catalytic pocket, similar 
to its position in the recently determined resting state of BcsA!”. 


BesA’s finger helix resets upon polymer extension 
Processive cellulose biosynthesis requires that the elongated polymer 
is translocated after each elongation cycle and the above described 
in crystallo cellulose extension demonstrates that glycosyl transfer and 
polymer translocation are separate steps. 

To identify whether the extended cellulose translocates spontane- 
ously over time, we extended the polymer in crystallo as described 
above, then diluted the substrate 65-fold, and incubated the crystals 
overnight before harvesting. Under these conditions, the density for 
the extended polymer continues to protrude into the catalytic pocket, 
suggesting that this state is stable in the absence of substrate (Fig. 1a). 
Notably, after extending the cellulose polymer, BcsA’s finger helix shifts 
to a ‘down position, such that Thr341 and Asp343 of its TED motif 
again form hydrogen bonds with the polymer’s terminal glucose unit 
(Fig. 1a). 


330 | NATURE | VOL 531 |17 MARCH 2016 


Figure 1 | In crystallo cellulose biosynthesis. 
BcsA is shown as a surface with the 
transmembrane and GT regions coloured grey 
and beige, respectively. BcsB is shown as a grey 
cartoon. The cellulose polymer is shown as 
cyan sticks, BcsA’s finger helix and gating loop 
are shown as a cartoon coloured yellow and 
green, respectively. Trp383 is represented in 
grey sticks as a marker for the transmembrane 
channel entrance. a, Left, organization of 

the BcsA-BcsB complex and formation of a 
channel for the translocating polymer. Right, 
the pre-translocation state of BcsA. Unbiased 
Sigma-A weighted F, — F, difference electron 
densities contoured at 4 and 30 are shown as 
green and magenta meshes for the nascent 
cellulose chain and finger helix, respectively. 

b, Translocation of cellulose. Crystals described 
in a were subsequently soaked with UDP/Mg?*. 
UDP is shown in sticks and coloured violet 

for the carbon atoms and Mg?" is shown as a 
green sphere. c, Insertion of the gating loop into 
the active site is incompatible with the ‘down’ 
position of the finger helix, probably owing to 

a clash between Ile340 and the gating loop’s 
backbone. The retracted gating loop and finger 
helix in the ‘down’ position are both shown 

as blue cartoons. d, Cellulose biosynthesis by 
BcsA 1340 mutants. Ile340 was replaced with 
the indicated residues and in vitro cellulose 
biosynthesis was performed in IMVs as 
described’. All activities are represented relative 
to the wild-type (WT) activity, and error bars 
represent standard deviations from 3 replicates. 
Inset, western blot analysis of the IMVs used, 
showing equal expression levels of all BcsA 
mutants. 


The finger helix movement is accompanied by retraction of a small 
loop, which contains Phe335, from a hydrophobic pocket beneath the 
finger helix (Fig. 1a). The resulting ‘pre-translocation’ state of BcsA- 
BcsB contains an empty catalytic pocket, a gating loop retracted from 
the active site (Extended Data Fig. 1), an extended polymer, and a 
downward pointing finger helix (Fig. 1a). 


Coupled movement of gating loop and finger helix 

The observed downward movement of BcsA’s finger helix in response 
to cellulose elongation suggests its role in translocation. If crystals 
containing the above described pre-translocation step are soaked with 
UDP/Mg’*, UDP binds and the gating loop inserts into the catalytic 
pocket (Fig. 1b). Additionally, the finger helix returns to the ‘up’ posi- 
tion and the density for the polymer’s newly-added glucose unit disap- 
pears (Fig. 1b), suggesting its translocation into the channel. 

To confirm that the elongated polymer is indeed translocated 
in crystallo, we extended the polymer by one unit with the chain- 
terminating analogue 6-thio-galactose, whose location can be unam- 
biguously determined based on anomalous X-ray scattering of its sulfur 
atom. After polymer extension, the 6-thio-galactosyl moiety sits inside 
BcsA’s catalytic pocket; then upon UDP/Mg?* binding and gating loop 
insertion into the active site, the density of the newly-added sugar 
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Figure 2 | Movement of BcsA’s finger helix is essential for cellulose 
translocation. In vitro cellulose formation by wild-type and mutant BcsA- 
BcsB complexes. a, Activity in IMVs under reducing (DTT) and oxidizing 
(sodium tetrathionate) conditions. For each mutant tested, the apparent 
activity was normalized to its activity in the presence of 50 mM DTT and 
(1) and (2) indicate the order of DTT and tetrathionate addition. b, DTT 
and tetrathionate titrations are shown in red and blue, respectively, for 
wild-type BcsA (dashed lines) and BcsA-2C (solid lines). All experiments 
were performed in triplicate and error bars represent the deviations from 
the means. Right, location of the engineered disulfide bond in BcsA-2C, 


disappears, and the thio-galactosyl unit moves into the transmembrane 
channel next to Trp383 (Extended Data Fig. 2), thereby confirming the 
genuine translocation of cellulose in crystallo. 

The ability of UDP/Mg?* to induce translocation brings about the 
question of why translocation doesn't occur immediately after glycosyl 
transfer when the UDP/Mg?* product is bound at the active site and 
the gating loop is inserted. All states of BcsA observed thus far either 
show its gating loop inserted into the active site and the finger helix in 
the ‘up’ position (Fig. 1b), or the gating loop retracted from the active 
site and the finger helix in the ‘dowr position, if the cellulose polymer is 
extended!” (Fig. 1a). This suggests that the finger helix cannot move 
to the ‘down’ position unless the gating loop is retracted from the active 
site and that in turn gating loop insertion could induce the upward 
movement of the finger helix. This coupled movement is probably due 
to steric clashes between the side chain of Ile340, preceding the TED 
motif of the finger helix, and the gating loop’s backbone (Fig. 1c). e340 
is primarily conserved among bacterial cellulose synthases, eukaryotic 
enzymes usually contain a valine or occasionally a leucine residue at 
this position, which could perform a similar function. Indeed, BcsA 
carrying a Val at position 340 shows indistinguishable catalytic activity 
compared to the wild-type enzyme, while an Ile to Leu substitution 
reduces the apparent activity by about 50% (Fig. 1d). Thr, Ala, Ser, Phe 
or Trp, however, support only low or background activities, suggesting 
that gating loop to finger helix coupling requires a fairly rigid, hydro- 
phobic residue at position 340 (Fig. 1d). 


Finger helix must move for cellulose translocation 

The conformational changes of BcsA described above suggest that its 
finger helix moves up and down during cellulose translocation. To test 
this hypothesis, we engineered double-Cys BcsA mutants expected 
to crosslink the finger helix to an amphipathic helix (IF2) above the 
GT domain!” (Fig. 1a and Extended Data Fig. 3), and screened those 
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coloured as in Fig. 1. c, Comparison of BcsA-2C finger helix positions 
following polymer extension under oxidizing and reducing conditions. 
Upon cellulose elongation, crystals were oxidized or reduced and 
incubated without substrate for 16 h or 6 h, respectively. d, Comparison of 
in crystallo cellulose translocation in wild type and BcsA-2C. The cellulose 
polymer was extended, then translocation was initiated as described in 
Fig. 1, and crystals were harvested after the indicated incubation periods. 
In all panels, the unbiased electron densities for the glucan and finger helix 
are contoured and coloured as described in Fig. 1. Trp383 of the acceptor- 
binding site is shown in grey. 


mutants for changes in catalytic activity upon oxidation. Introducing 
Cys residues at positions 338 near the N terminus of the finger helix 
and 394 at the C-terminal end of IF2 (hereafter referred to as BcsA-2C) 
results in attenuated catalytic activity compared to the wild-type 
enzyme or single-Cys mutants in inverted membrane vesicles (IMVs). 
Full activity, however, can be restored upon addition of excess DIT 
(Fig. 2a). Upon purification of BcsA-2C and reconstitution into pro- 
teoliposomes, its catalytic activity further decreases to ~20% under 
non-reducing conditions (Fig. 2b), probably owing to complete 
disulfide bond formation during purification. Addition of the oxidizing 
reagent tetrathionate does not further decrease the enzyme’s apparent 
activity, yet the catalytic activity robustly recovers with increasing DTT 
concentrations. The crystal structure of this BcsA-2C mutant reveals 
that the disulfide bond forms when the finger helix is in the ‘up’ posi- 
tion (Fig. 2b and Extended Data Fig. 3), similar to its position in the 
post-translocation state”. 

Additionally, the ability to observe cellulose elongation and trans- 
location in crystallo allows us to further delineate how the engineered 
disulfide bond affects BcsA’s activity. Soaking BcsA-2C crystals with 
substrate leads to polymer extension as observed for wild-type BcsA, 
demonstrating that glycosyl transfer is not abolished by the mutations 
(Fig. 2c, d). Subsequently, if those crystals are then incubated overnight 
under oxidizing conditions, the finger helix remains in the ‘up’ position, 
revealing that the cross-link prevents the finger helix from resetting to 
the ‘down position (Fig. 2c). Binding of UDP/Mg”* in these crystals, 
which induces polymer translocation in wild-type BcsA within 15 to 
45 min, fails to initiate translocation, even after an overnight incuba- 
tion (Fig. 2d). However, reducing the engineered disulfide bond in the 
BcsA-2C complex restores the capability of the finger helix to move 
downwards following polymer extension and, most notably, restores 
polymer translocation (Fig. 2c, d), thereby directly correlating the move- 
ment of the finger helix with BcsA’s ability to translocate the polymer. 
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Figure 3 | The product-bound state. The product-bound state of BcsA 
contains an elongated cellulose polymer and an inserted gating loop 
coordinating UDP/Mg?" at the active site. a, Cellulose was elongated 

in crystallo with a 2-deoxy-2-fluoro-glucose moiety and UDP/Mg** 

was rebound to the active site as described in Fig. 1. Unbiased Sigma-A 
weighted F, — F, difference electron densities contoured at 4 and 4.5a are 
shown for the nascent cellulose polymer and UDP/Mg?" in green and 
blue, respectively. b, The terminal glucose unit of the extended cellulose 
polymer forms interactions with the 8-phosphate of UDP and Asp246 
of the DxD motif. Colours are as in Fig. 1 and fluorine is shown in 

light blue. 


The product-bound state 

Directly after glycosyl transfer, BcsA contains an elongated glucan 
plus UDP/Mg?" and an inserted gating loop at the active site, as well 
as the finger helix in the ‘up’ position. This ‘product-bound’ state is 
accessible through the BcsA-2C mutant described above. Because the 
engineered disulfide bond tethers the finger helix, UDP/Mg** can 
be bound to the active site after polymer extension without inducing 
translocation. 

Alternatively, we observed that incorporating 2-fluoro-substi- 
tuted glucose into the polymer stabilizes a similar product-bound 
state. Attempting to trap a ‘donor-bound’ state by using the usually 
(but not always) unreactive UDP-2-fluoro-glucose as substrate?”?>, 
we observed that the nascent glucan is elongated. However, the 
subsequent translocation of this polymer is significantly impeded 
(but not abolished), which may be due to the loss of a hydrogen bond 
between Thr341 of the finger helix and the polymer’s terminal C2 
substituent, Fig. la. Thus, elongating the cellulose polymer with 
2-fluoro-glucose in wild-type BcsA, then diluting the substrate and 
soaking in UDP/Mg?* reproduces a similar product-bound state to 
that obtained for the BcsA-2C mutant. Due to higher quality diffrac- 
tion data (Extended Data Table 1), we discuss the structure obtained 
with 2-fluoro-glucose. 

The product-bound BcsA structure shows that the catalytic pocket 
can accommodate both an extended cellulose polymer and UDP/Mg”", 
suggesting that the newly added glucose unit can align with the 
polymer before the gating loop retracts from the active site and UDP 
is released (Fig. 3). In this position, the only major interactions of the 
terminal glucose unit are with UDP’s 8-phosphate as well as Asp246 of 
the ‘DxD’ motif" via its C2 or C6 hydroxyl group (depending on its 
orientation) (Fig. 3b). 

Of note, the individual glucose units in cellulose are rotated by 
approximately 180° relative to their neighbours***. Therefore, during 
relaxation into the polymer’s plane, the newly added glucose moiety 
must rotate either clockwise or counter clockwise to be in register with 
the preceding glucose units”>®, This alternating rotation is most likely 
to be driven by the formation of intramolecular hydrogen bonds*® 
and could be facilitated by UDP release after glycosyl transfer, which 
minimizes steric restrictions at the active site (Fig. 3). 


The donor glucose binds to a pocket under the acceptor 
To stabilize the substrate-bound state of BcsA, we synthesized and 
employed a non-hydrolysable phosphonate substrate analogue”’, 
(UDP-CH)-Glc), in which a methylene bridge connects the donor 
glucose with UDP’s B-phosphate. Additionally, we also capped the cel- 
lulose polymer with galactose (see Methods), which cannot be extended 
owing to an axial instead of an equatorial hydroxyl group at its C4 
position. 
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Figure 4 | The substrate-bound state. The donor glucose binds in a 
conserved pocket beneath the acceptor. a, Unbiased Sigma-A weighted 

F, — F, difference electron density of UDP-CH)-glucose at BcsA’s active site, 
contoured at 40. Right, conserved residues of BcsA involved in coordinating 
the donor glucose are shown as sticks. b, Interactions between the donor 
sugar moiety and BcsA’s Doa6xD, FFC31gGS, T341ED343 and QxxR382W33 
motifs. c, Comparison of substrate- and product-bound states. Aligning the 
substrate’s pyrophosphate group with the position of UDP in the product- 
bound state (Fig. 3) positions the donor’s C1 carbon within approximately 
2.9 A of the acceptor’s hydroxyl group. The substrate is shown as grey and 
black sticks for its UDP and glucose moieties, respectively. 


The donor glucose inserts underneath the acceptor into a conserved 
hydrophilic pocket formed by BcsA’s TED, HAKAG and FFCGS motifs 
(Fig. 4a), and the gating loop cooperates with the QxxRW motif to 
stabilize the substrate. In particular, the gating loop’s Phe503 forms 
cation-T interactions with Arg382 of the QxxRW motif, which in turn 
forms a salt bridge with the nucleotide’s 8-phosphate as well as a hydro- 
gen bond with the donor’s Cé hydroxyl (Fig. 4a, b). Further, the donor’s 
C3 hydroxyl interacts with the backbone carbonyl of Cys318 of the 
FFCGS motif and its ring oxygen is in hydrogen bond distance to the 
Ne of Trp383 of the QxxRW motif (Fig. 4b). Thus, the recognition 
of the donor’s C3 and C6 hydroxyls and its ring oxygen seems to be 
particularly important for substrate selectivity. 

In the substrate-bound state, BcsA’s finger helix is in the ‘up’ con- 
formation and positions Asp343 of the TED motif within 2.5 A of the 
acceptor’s C4 hydroxyl group, consistent with its likely function as 
general base during catalysis (Fig. 4b). However, the distance between 
the acceptor and the donor's C1 carbon is about 4.2 A (assuming 
glucose instead of galactose as the polymer’s terminal sugar), which 
is probably too far for a direct transfer. We note that the pyrophos- 
phate group of UDP-CH)-Glc is less deeply inserted into the active 
site compared to UDP in the product-bound state (Extended Data 
Fig. 4), perhaps owing to the substrate’s methylene-bridge and/or 
the capping of the glucan with galactose. Repositioning UDP-CH)- 
Glc according to the UDP conformation in the product-bound state 
places the donor’s C1 carbon within approximately 2.9 A of the accep- 
tor’s C4 hydroxyl, a suitable distance for glycosyl transfer”® (Fig. 4c). 
This distance is likely to be also maintained when the acceptor is 
in the opposite orientation (as is the case for every other glucose 
unit), owing to repositioning of the terminal glucose unit at the 
active site. 
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Figure 5 | Model of cellulose biosynthesis. a, Cellulose biosynthesis 
might start with substrate binding to BcsA when the polymer’s 

terminal glucose unit sits at the acceptor site at the entrance to BcsA’s 
transmembrane channel. At this time, the gating loop stabilizes the 
substrate and the finger helix in the ‘up’ position (red arrow). b, c, Glycosyl 
transfer generates the product-bound state (b) and retraction of the gating 
loop and UDP release allows the finger helix to reset to the ‘down’ position 
to contact again the polymer’s terminal glucose unit (c). Substrate binding 
to this pre-translocation state and insertion of the gating loop could 
induce the upward movement of the finger helix and polymer translocation 
(a) or spontaneous translocation might precede substrate binding via a 
post-translocation state (d). 


Implications for cellulose biosynthesis 

Cellulose biosynthesis requires that BcsA binds the substrate, positions 
it for and catalyses glycosyl transfer, translocates the extended polymer, 
and exchanges UDP with UDP-Glc for a subsequent elongation cycle. 
Our structural snapshots of a complete cellulose biosynthesis cycle 
suggest that BcsA accomplishes this in three steps (Fig. 5). 

Upon substrate binding, BcsA’s gating loop inserts into the catalytic 
pocket, thereby positioning the donor glucose for transfer and per- 
haps also stabilizing the UDP leaving group (Fig. 5). In this state, the 
acceptor glucose rests next to Trp383 at the entrance to the transmem- 
brane channel and interacts with the TED motif at the N terminus of 
the finger helix. After glycosyl transfer, the newly added glucose unit 
aligns with the polymer and extends into the catalytic pocket next to 
UDP’s pyrophosphate group. In this product-bound state, the gating 
loop remains inserted into the active site and the finger helix contin- 
ues to point ‘up’ as observed in the substrate-bound state. Next, BcsA’s 
gating loop retracts to release UDP and the finger helix resets to the 
‘down position to interact again with the polymer’s terminal glucose 
unit. Binding of a new substrate molecule to this pre-translocation state 
could elicit the translocation of the extended polymer (by an upward 
movement of the finger helix) through re-insertion of the gating 
loop into the active site. In crystallo translocation experiments with a 
galactose-capped polymer and UDP-Glc as substrate confirmed that 
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the polymer can indeed be translocated when UDP-Glc/Mg”* binds to 
the active site (Extended Data Fig. 5). An alternative, perhaps slower, 
pathway could be the translocation of the polymer before substrate 
binding, facilitated either by random cycles of gating loop insertion 
and retraction or favourable interactions of the polymer in the extra- 
cellular milieu. 

How could the opening of the gating loop and resetting of the finger 
helix be coordinated? The retraction of the gating loop after glycosyl 
transfer could be facilitated by the rotation of the newly added sugar 
into the plane of the polymer. Alternatively, it is possible that the Mg”* 
coordination changes after glycosyl transfer, which in turn could affect 
the stability of UDP and the gating loop at the active site, as proposed 
for non-processive galactosyl transferases”. 

BcsA’s finger helix is capped at its N terminus by the TED motif, 
which is invariant among cellulose synthases. Within the motif, Thr341 
and Asp343 form hydrogen bonds with the polymer’s terminal sugar 
unit, which may enable the finger helix to exert force on the polymer 
during the upward movement. Both residues are well-suited for this 
task: as a B-branched amino acid, the side chain hydroxy] of Thr is ster- 
ically restricted, and Asp343 is further rigidified by interactions with 
its backbone amide proton and that of the following residue (Extended 
Data Fig. 6). 

How might the finger helix move downward without retro-translo- 
cating the glucan? N-terminal capping of «-helices with Asp residues 
has been shown to significantly stabilize the helical conformation in a 
pH dependent manner*”?!. During catalysis, Asp343 abstracts a proton 
from the acceptor’s C4 hydroxyl group", thereby probably altering its 
interaction with the amide protons at the N terminus of the finger 
helix. Thus, we speculate that the destabilization of the finger helix 
during glycosyl transfer enables it to refold in the ‘down’ position, after 
UDP release and subsequent deprotonation of Asp343. This notion 
is supported by the position of another conserved residue within the 
finger helix. Thr346 sits at the membrane distal side of the finger helix 
just three residues C-terminal of Asp343, and its side chain hydroxyl 
provides an alternative hydrogen bond partner for the backbone car- 
bonyl of the preceding Glu342 (Extended Data Fig. 6). Threonine res- 
idues in a-helices, in particular in hydrophobic environments, often 
induce helical kinks, which could facilitate the repositioning of the 
finger helix, 

On its own, the finger helix is insufficient for cellulose translocation, 
which requires substrate or UDP binding and gating loop insertion 
into the active site. Thus, we conclude that the free energy of sub- 
strate binding energizes cellulose translocation. Additional thermo- 
dynamic driving force for translocation may be generated by the base 
catalyst itself. The post-translocation state is likely to be energetically 
favourable, owing to a strong interaction between Asp343 and the 
acceptor’s C4 hydroxyl. This interaction is broken upon protonation 
of its side chain during catalysis but re-established after polymer 
translocation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Incrystallo cellulose synthesis. Rhodobacter sphaeroides BcsA-BcsB was purified 
and crystallized as previously described” by the bicelle crystallization method 
with the exception that gel filtration and crystallization were carried out in buffers 
lacking MgCh. After the crystals grew to full size (about 2 weeks), cryo-protection 
was initiated by three successive 211 additions of cryo solution (well solution con- 
taining 20% glycerol) to the crystal mother liquor without added MgCh, each 
addition separated by 10 min. 

After the third addition of cryo solution, the polymer was elongated by adding 

0.8 ul of 100 mM UDP-activated sugar (glucose or 2-fluoro glucose) in the absence 
of MgCl, to the ~8 jl crystallization drop for a final UDP-sugar concentration 
near 10mM. The crystals were incubated with UDP-sugar for 2-3 h at 30°C. After 
incubation, 6 11 of the crystallization solution was replaced with an equal volume 
of fresh cryo solution and this process was repeated twice to dilute the UDP-sugar 
concentration approximately 65-fold. Subsequently, crystals were then looped and 
flash-cooled in liquid N> at various time points. 
In crystallo cellulose translocation. For in crystallo translocation experiments 
using wild-type BcsA- BcsB, 0.811 of a solution containing 100 mM UDP and 
250mM MgCl was added to the ~8 1] crystallization drop (see above) for a final 
concentration of ~10mM UDP and 25mM MgCl). Crystals were looped and 
flash-cooled in liquid N> at various time points. 

For translocation experiments using the BcsA-2C mutant, cellulose was 
extended as described above and after completion of cryo-protection, sodium 
tetrathionate was added to a final concentration of ~1 mM or dithiobutylamine 
was added to a final concentration of ~100 mM, followed by incubation for 30 
or 15 min, respectively. Then, UDP/MgCl, were added as described above, and 
crystals were harvested at various time points. 

BcsA accepts UDP-Glc as well as UDP-Gal as substrates; however, because 
galactose is the C4 epimer of glucose, elongation of the cellulose polymer with 
galactose is expected to stall after a single turn over. Thus, for translocation experi- 
ments using UDP-6-thio-Gal as substrate, polymer extension was performed as 
described above for other substrates with the exception that the cryo solution 
contained 25 mM MgCl, and the 100 mM UDP-6-thio-Gal solution contained 
84mM dithiothreitol (DTT). 

Data collection. Diffraction data for wild-type BcsA-BcsB and its double cysteine 
mutant were collected and processed as previously described”°. Diffraction data 
for UDP-6-thio-Gal were collected at 6.5 keV at NE-CAT to high redundancy. 
Phases were obtained by molecular replacement using a search model composed 
of pdb 4P02 with all ligands as well as residues 332-350 (finger helix) and 499-510 
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(gating loop) of BcsA omitted. All ligands except for the final 4 glucose units of 
the cellulose polymer were subsequently added, and the models were refined in 
Phenix_refine**. Ramachandran analyses of the product-bound, substrate-bound 
and pre-translocation state structures identify 95.8/3.9/0.3%, 97.6/2.4/0.0% and 
96.8/3.2/0.0% residues in the preferred/allowed/outlier regions, respectively. 
Figures were prepared using PyMol* and crystallographic software is supported 
by SBGrid**. 

UDP-CH)-Glc soak to generate the donor-bound state. BcsA-BcsB was crystal- 
lized in the presence of 1 mM UDP-Gal, which was added to the protein/bicelle 
solution before mixing with the crystallization well solution. Fully-grown crystals 
were cryo-protected at 24°C as described above. The crystals were then incubated 
with cryo-solution containing 1 mM UDP-CH)-Glc and 10mM MgCl, for 20 min, 
harvested, and flash-cooled in liquid No. 

Finger helix cross-linking and activity assays. BcsA cysteine mutants were gener- 
ated from the constructs described earlier’® by using the QuikChange mutagenesis 
technique, and the mutant BcsA-BcsB complex was expressed and prepared as 
inverted membrane vesicles (IMVs) or purified and reconstituted into proteoli- 
posomes (PLs) as previously described’. 

PLs were diluted to 125nM (or IMVs to 12% v/v) with 125mM NaCl and 
25mM sodium phosphate, pH 7.2, and incubated with increasing concentrations 
of DTT or sodium tetrathionate each for 15 min at 37 °C, before initiating cellulose 
biosynthesis. 

To initiate cellulose biosynthesis, c-di-GMP, UDP-Glc, UDP-[#H]Glec, and 
MgCl, were added, giving final concentrations of 100nM BcsA-BcsB, 20 mM 
sodium phosphate, pH 7.2, 100 mM NaCl, 20mM MgCh, 5mM UDP-Glec, 0.25 ,1Ci 
UDP-[?H]Glc, and 204M c-di-GMP. Reactions were carried out in 2511 aliquots 
at 24°C for 15 min, terminated by adding 2% SDS, and the tritium-labelled cel- 
lulose product was quantified by scintillation counting as previously described’. 
Reactions with IMVs containing BcsA Cys or 1340 mutants were incubated at 24°C 
for 3h or 30 min, respectively. All experiments were performed at least in triplicate 
as technical replicates and error bars represent the deviations from the means. 
UDP-sugar syntheses. See Supplementary Information for detailed protocols of 
chemical syntheses and product characterizations. 


34. Adams, P. D. et al. PHENIX: a comprehensive Python-based system for 
macromolecular structure solution. Acta Crystallogr. D 66, 213-221 (2010). 

35. PyMol. DeLano Scientific (Sancarlos CA, USA). 

36. Morin, A. et al. Collaboration gets the most out of software. eLife 2, e01456 
(2013). 
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Extended Data Figure 1 | Conformational flexibility of the gating loop and two alternative backbone positions are indicated by a black dashed 
after cellulose extension. Unbiased Sigma-A weighted F, — F. difference line. The position of the gating loop in the inserted state in the presence 
electron density of the gating loop in the pre-translocation state contoured — of a UDP molecule as observed in PDB entry 4P00 is shown as a cartoon 
at 2c. The ordered part of the gating loop is shown as a thick green ribbon _ representation coloured blue. 
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Pre-translocation 


Extended Data Figure 2 | In crystallo translocation of a 6-thio- 
galactose-containing cellulose polymer. The position of the 6-thio- 
galactose group at the polymer’s non-reducing end was determined after 
polymer extension (upper panel) and upon subsequent incubation with 
UDP/Mg?* (lower panel) in an anomalous difference Fourier electron 
density (DANO) map. DANO peaks detected at a wavelength of 1.74A 
are shown as a red mesh contoured at 3.50. Unbiased Sigma-A weighted 
F, — F, difference electron density for the cellulose polymer is shown as 


a green mesh contoured at 40. The cellulose polymer was extended and 
translocated as described in Fig. 1 with the exception that UDP-6-thio- 
galactose was used as substrate and Mg”* was included during the initial 
soaking step. The extended DANO peak around Cys318 in the post- 
translocation state might arise from overlapping peaks originating from 
Cys318 and the thio-Gal unit in an opposite orientation. All Cys and Met 
residues close to BcsA’s active site are shown as sticks. UDP is shown as 
sticks in violet for its carbon atoms. 
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BesB f a Finger Helix 


Extended Data Figure 3 | Position of the disulfide-tethered finger helix. _ respectively. b, The finger helix-tethered BcsA-BcsB complex was refined 


The BcsA-2C-BcsB complex was crystallized as described for wild-type in a resolution range from 34 to 3.2A toa final R/Réree of 19.9/23.9% 
BcsA-BcsB. a, Unbiased Sigma-A weighted F, — F. difference electron in Phenix_refine** with Ala residues at positions 338 and 394 of BcsA. 
density of BcsA’s finger helix contoured at 40 (magenta mesh). Cellulose A strong difference electron density peak indicates the position of the 
and BcsA’s Trp383 at the entrance to the transmembrane channel are omitted disulfide bond in a Sigma-A weighted F, — F. difference electron 
shown as sticks in cyan and grey for their carbon atoms, respectively. The density map, (green mesh, contoured at 20). 


finger helix and IF2 are shown as cartoon helices coloured yellow and grey, 
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Extended Data Figure 4 | Comparison of the UDP conformation in the the carbon atoms and the UDP molecule from PDB entry 4P00 is shown 


substrate and UDP-bound states of BcsA. The substrate-bound BcsA as grey sticks. BcsA’s finger helix is shown as a yellow cartoon and the 
structure was superimposed with PDB entry 4P00 by secondary structure cellulose polymer is shown as cyan and red ‘balls and sticks’ as observed in 
matching in Coot. The substrate is shown as ‘balls and sticks’ in violet for PDB entry 4P00. Magnesium is shown as a green sphere. 
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Extended Data Figure 5 | UDP-Glc induced polymer translocation. presence of UDP/Mg’+ or UDP-Glc/Mg”", respectively. The unbiased 

The nascent cellulose polymer was extended with a chain-terminating SigmaA-weighted F, — F, difference electron density of the nascent 
galactose residue upon soaking BcsA-BcsB crystals with UDP-Gal. polymer (green mesh) is shown at three different contour levels, indicating 
Following dilution of the substrate as described in Fig. 1, crystals were that UDP-Glc also induces polymer translocation. 


incubated for 150 min either in the absence of a nucleotide or in the 
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Extended Data Figure 6 | Stabilization of BcsA’s finger helix by 
conserved residues. Top panel: stick representation of BcsA’s finger 
helix and nascent cellulose polymer shown in yellow and cyan for their 
carbon atoms. The finger helix is shown as a poly-glycine helix except for 
the labelled residues. Bottom panel: The finger helix’s “‘TEDxxT” motif is 
conserved among pro- and eukaryotic cellulose synthases. Finger helix 
sequences are aligned for Micrasterias denticulata CesA, Physcomitrella 


patens CesA5, Arabidopsis thaliana CesA8, Rhodobacter sphaeroides and 
Escherichia coli BcsA, and Ciona savignyi CesA. The conserved threonine 
following the TED motif is indicated with a red box. Of note, the threonine 
residue is absent from the Ciona CesA sequence; however, this protein 
contains a serine residue at the following position, which could perform a 
similar function. 
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Extended Data Table 1 | Crystallographic data collection and refinement statistics 


Data collection 


Space group P2,2,2) P2,2,2) P2,212) 
Cell dimensions 

a, b, c (A) 67.3, 216.8, 221.1 68.0 216.8 220.8 67.4 218.2 220.8 

a, B,y (°) 90, 90, 90 90, 90, 90 90, 90, 90 
Resolution (A) 39.43—2.95 (3.01— 35.05—2.90 (2.96-2.90) 29.48-3.4 (3.52-3.4) 

2.94)* 

Rosien 0.060 (0.512) 0.094 (0.514) 0.056 (0.638) 
CCj)-* 0.995 (0.636) 0.986 (0.577) 0.997 (0.787) 
Mean J / of 10.3 (1.6) 5.2 (1.2) 9.6 (1.3) 
Completeness (%) 98.6 (82.4) 99.3 (90.0) 99.7 (100.0) 
Redundancy 5.0 (4.2) 10.4 (9.1) 6.6 (6.9) 
Refinement 
Resolution (A) 34.35—2.94 34.92-2.95 29.48-3.4 
No. reflections 

Total 68,776 70,259 86,075 

Rirce 3,429 3,329 4,367 
Ryork / Riree (Yo) 20.6/23.4 20.7/24.2 22.78/26.8 
No. atoms 

Protein 10,673 10,725 10,618 

B-1,4 glucan 198 188 199 

c-di-GMP 92 92 92 

UDP 25 : < 
UDP-CH>-Gle - 36 - 

Mg” 2 2 

Lipids 89 90 69 
B-factors 

Chain A 82.8 87.41 155.0 

Chain B 72.2 79.4 147.0 

Chain D 91.3 106.9 211.0 

B-1,4 glucan 80.4 87.7 164.3 

c-di-GMP 68.8 71.4 144.7 

UDP 97.3 
UDP-CH)-Gle 79.4 

Lipids 85.6 122.4 156.0 
R.m.s deviations 

Bond lengths (A) 0.003 0.003 0.004 

Bond angles (°) 0.861 0.905 1.052 


Product-bound 


Substrate-bound 


* Values in parentheses refer to the highest-resolution shell. 
’ Correlation between intensities from random half-data sets. 
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Crystal structures of the M1 and M4 
muscarinic acetylcholine receptors 


David M. Thal'*, Bingfa Sun?*, Dan Feng’, Vindhya Nawaratne!, Katie Leach!, Christian C. Felder*, Mark G. Bures’, 
David A. Evans®, William I. Weis®’, Priti Bachhawat?, Tong Sun Kobilka’, Patrick M. Sexton!, Brian K. Kobilka?® 


& Arthur Christopoulos! 


Muscarinic M1-M5 acetylcholine receptors are G-protein-coupled receptors that regulate many vital functions of the 
central and peripheral nervous systems. In particular, the Ml and M4 receptor subtypes have emerged as attractive 
drug targets for treatments of neurological disorders, such as Alzheimer’s disease and schizophrenia, but the high 
conservation of the acetylcholine-binding pocket has spurred current research into targeting allosteric sites on these 
receptors. Here we report the crystal structures of the M1 and M4 muscarinic receptors bound to the inverse agonist, 
tiotropium. Comparison of these structures with each other, as well as with the previously reported M2 and M3 receptor 
structures, reveals differences in the orthosteric and allosteric binding sites that contribute to a role in drug selectivity 
at this important receptor family. We also report identification of a cluster of residues that form a network linking the 
orthosteric and allosteric sites of the M4 receptor, which provides new insight into how allosteric modulation may be 


transmitted between the two spatially distinct domains. 


The M1-M5 muscarinic acetylcholine receptors constitute an impor- 
tant family of class A G-protein-coupled receptor (GPCRs) activated 
by the neurotransmitter, acetylcholine’. Both the M1 and M4 receptors 
have been associated with learning, memory, and cognition” and have 
emerged as attractive targets for the treatment of various central nerv- 
ous system disorders, including Alzheimer’s disease, schizophrenia, 
and drug addiction**. However, the orthosteric acetylcholine-binding 
site is highly conserved, and the clinical translation of compounds 
targeting these receptor subtypes has remained largely unsuccessful 
owing to adverse side effects from off-target activity at peripheral M2 
and M3 receptor subtypes’ ’. Encouragingly, muscarinic receptors pos- 
sess spatially distinct allosteric binding sites that offer greater potential 
for selective receptor targeting’? '”, and the M1 and M4 receptors are 
prime examples where highly selective positive allosteric modulators 
(PAMs) with central nervous system activity and preclinical efficacy 
have been identified*!*""”, 

So far, however, the structural basis of drug action at these receptor 
types has been largely restricted to mutational analyses'*-?!, with the 
only reported muscarinic receptor crystal structures being of the M2 
and M3 subtypes”*”3. Thus, to better understand the molecular basis 
for orthosteric and allosteric drug interactions with the M1 and M4 
receptors, we sought to obtain high-resolution X-ray crystal struc- 
tures of both subtypes. To gain additional insight into potential mech- 
anisms of allosteric modulation, we complemented our findings with 
active-state homology modelling to rationalize the effects of targeted 
mutations on the interaction between a well-characterized PAM and 
acetylcholine at the M4 receptor. 


Crystallization of the M1 and M4 receptors 

To determine the structures of the M1 and M4 muscarinic receptors, 
we used protein engineering and lipidic cubic phase methodology”™*”’. 
Both receptors were crystallized in the presence of the high-affinity 
and clinically used inverse agonist, tiotropium (Spiriva), to stabilize 


the inactive state. Intracellular loop 3 (ICL3) of the M1 receptor was 
replaced with a T4 lysozyme fusion protein, and in the case of the M4 
receptor a minimal T4 lysozyme (mT4L)*® fusion was used to aid crys- 
tallization (Extended Data Fig. 1). It was also necessary to remove the 
first 21 residues of the amino (N) terminus from the M4 receptor to 
improve diffraction. The M1 receptor was also crystallized with the 
N2Q and N12Q mutations to remove glycosylation sites, and, unin- 
tentionally, an N110Q?*” mutation. Importantly, the binding affinities 
of [7H]QNB (M1 receptor), [7>H]NMS (M4 receptor), acetylcholine, or 
tiotropium were not significantly different at either fusion construct 
compared with the wild-type receptor, suggesting that the alterations 
did not perturb the orthosteric site; the M1 N1 10Q??” mutation also 
had no significant effect on receptor functionality in the absence of 
T4 lysozyme (Supplementary Table 1). The M1 and M4 structures were 
subsequently determined to a resolution of 2.7 A and 2.6 A, respectively 
(Extended Data Table 1). 


Comparison of muscarinic receptor structures 

Overall, the structures of the M1 and M4 receptors are similar to the 
previously solved inactive M2 and M3 receptors”, with similar posi- 
tioning of the seven-transmembrane (TM1-7) bundle and root mean 
squared deviations of 0.6-0.9 A (Fig. 1a). Subtle differences between 
the receptors are observed on the extracellular and intracellular sides 
(Fig. 1b, c) corresponding to regions that are least conserved across 
the muscarinic subtypes (Extended Data Fig. 2). For example, the M2 
receptor differs from the other receptors in the tilt and position of TM1 
and TM7 (Fig. la, b). Notably, the M1 receptor was co-crystallized 
with a Flag peptide co-bound on the intracellular side, which makes 
extensive contacts with TM6 and ICL3 (Extended Data Fig. 3a, b), 
and probably contributes to observed differences in TM5, TM6, and 
a variable linkage between TM7-helix8 (Extended Data Fig. 3c). The 
MI1-N110Q?’ mutation has little effect on the M1 structure other than 
creating a slight bulge in TM4 due to the loss of a hydrogen bond with 
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Figure 1 | Structural comparison of the M1-M4 receptors. a, The 
overall view of the muscarinic structures is shown as cartoons aligned 

to the M3 receptor, with the M1 coloured in green, M2 in yellow (PDB 
accession number 3UON), M3 (PDB accession number 4U15, chain A) 

in orange, and M4 (chain A) in blue. Root mean squared deviations 

for the alignment (excluding T4L fusions) of M1, M2, and M4 versus 

the M3 receptor are 0.86 A, 0.81 A, and 0.62 A, respectively. The ligand, 
tiotropium, for the M4 receptor is shown as sticks and coloured according 
to element: carbon, light blue; oxygen, red; nitrogen, dark blue; sulfur, 
yellow. b, c, Comparison of the views from (b) the extracellular side 


S*+93 (Extended Data Fig. 3d). More interestingly, the M4 receptor was 
crystallized with an intact ionic lock (Extended Data Fig. 3e) a feature 
uncommonly seen in other GPCRs and not present in the other mus- 
carinic structures. It is important to note that the observed differences 
in the intra- and extracellular sides of the receptor occur in regions that 
are solvent accessible or are involved in crystal packing interactions, 
which could contribute to the observed perturbations between sub- 
types; however, none of the crystal packing interactions grossly affect 
the structure or the core of the receptor. 

Like the inactive M3 receptor, the M1 and Mé4 receptors were crys- 
tallized in complex with the inverse agonist, tiotropium, and this 
binding site is buried deep within the transmembrane core (Fig. 1d). 
The binding pose of tiotropium and surrounding residues between these 
three structures is nearly identical (Fig. 1d-f), which is not surprising 
given the near absolute conservation of residues lining the orthosteric 
site in the muscarinic family (Extended Data Fig. 2). However, this 
high degree of sequence conservation does not preclude the possibility 
of differences in tertiary structure with respect to the orthosteric site. 
Indeed, one surprising difference is a change in the rotamer of D112” 
of the M4 receptor (Fig. 1 f, g); a residue that is conserved throughout 
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and (c) the intracellular side. d, M1 and M4 residues involved in 
tiotropium binding are shown as sticks (several residues are omitted 

for clarity). The black dashed line indicates a bidentate hydrogen bond 
between N° and tiotropium. e, Superposition of tiotropium from the 
M1, M3, and M4 structures and QNB from the M2 structure. The arrow 
indicates the main structural difference between tiotropium and QNB. 

f, Comparison of the orthosteric binding site of the M1-M4 receptors 
with orthosteric site residues shown as sticks. g, The rotameric change of 
D112? is stabilized by a network of hydrogen bonds. 


the biogenic amine GPCRs and serves as the counter ion for positively 
charged neurotransmitters”’. This rotameric change points D112°*7 
away from tiotropium and is accompanied by slight movements of 
Y¥4397*° and Y44373, allowing them to form a network of hydrogen 
bond interactions between D112?” and 857°”, W10878, Y43979, 
and Y443’?, which is distinct from the M1, M2, and M3 muscarinic 
receptor structures (Fig. 1g). 

Further comparison of the M1, M3, and M4 tiotropium-bound 
structures with the M2 receptor, which was crystallized with the structur- 
ally similar inverse agonist, QNB, also revealed considerable differences 
around residues D***, Y’*°, and Y’. These three residues surround 
the amine group, which is slightly more bulky for QNB than tiotropium 
(Fig. le-g). Indeed, previous mutagenesis studies” on the M1 receptor 
revealed ligand-specific changes in binding affinities of NMS and QNB 
upon mutation of Y’*? and Y’* to alanine. For the ligand NMS, which 
has a structurally similar tropane ring to tiotropium, a 25- and 48-fold 
loss of binding affinity was observed for the Y7? and Y’*? mutations, 
respectively, whereas little effect was observed for QNB. This suggests 
a potential role for these two residues in stabilizing different inactive- 
state conformations with QNB potentially making compensatory 
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Figure 2 | Comparison of orthosteric and allosteric binding site residues 
across the M1-M4 receptors. a, b, The M1-M4 receptors are aligned to the 

M3 muscarinic receptor (PDB accession number 4U15, chain A) and are shown as 
grey coloured cartoons with views from the (a) membrane and (b) extracellular 
side. Residues in ECL2 are numbered relative to the position of the disulfide- 
bonded cysteine. Carbon atoms are coloured by site with the orthosteric residues 


interactions unavailable to NMS. The fact that the orthosteric site of 
the M4 receptor is in some ways closer to the M1 than the M2 subtypes 
may also allow some rationalization of the relative subtype selectivity 
for canonical orthosteric antagonists such as pirenzepine, which has 
long been known to have a rank order potency of M1 > M4 > M3 > M2 
(ref. 29). We performed induced fit docking (IFD) experiments of the 
antagonist into the inactive-state structures of the M1-M4 receptors. 
The overall poses for pirenzepine were very similar, with slight variabil- 
ity in the positioning of the methylpiperazine moiety (Extended Data 
Fig. 4a). However, there were still distinct differences in the orientation 
of residues D*? and Y’*? between the M1, M3, and M4 subtypes versus 
the M2 receptor, with D?? oriented towards and Y”*? away from the 
methylpiperazine group in the M2 IFD (Extended Data Fig. 4b). These 
differences should be interpreted with caution, as it is possible they 
reflect a restricted sampling in the IFD protocol, and may not be reflec- 
tive of a genuine M2 specific preference. Nevertheless, these results sug- 
gest that the differences in positions of D**? and Y’°?, which surround 
the methylpiperazine group, could contribute to the marked difference 
in potency for pirenzepine between the M1 and M2 subtypes”. 


Allosteric binding and cooperativity 
A comparison of all four solved muscarinic receptor structures illus- 
trates the strikingly high degree of conservation of the residues con- 
stituting the orthosteric site (Fig. 2, green), thus providing a structural 
basis for the difficulty in achieving subtype selectivity when targeting 
this region. In contrast, muscarinic receptors possess a large extra- 
cellular vestibule that contains residues contributing to an allosteric 
site. As shown in Fig. 2 (blue), comparison of these residues reveals a 
striking divergence between subtypes, owing to differences in amino- 
acid composition (Extended Data Fig. 2) and likely additional 
tertiary structure changes that arise as a consequence of the dynamic 
nature of the extracellular loop regions. Also shown in Fig. 2 (yellow) 
are residues that have been previously suggested to form the ‘roof’ of 
the orthosteric site and ‘floor’ of the allosteric site?°*°. These ‘shared’ 
residues show an intermediate degree of tertiary structure divergence 
between subtypes compared with the orthosteric and allosteric site 
residues, and are conserved among all five subtypes with the exception 
of the M2 receptor where L in ECL2 is replaced by F. 

Comparison of the electrostatic surface potential of each receptor 
(Fig. 3) also reveals distinct differences in both the shape and charge 


in green, dark blue for allosteric conserved residues, light blue for allosteric 
non-conserved residues, and yellow for residues that contribute to both sites. 
Oxygen atoms are coloured red, nitrogen blue, and sulfur yellow. Non-conserved 
residues are labelled according to either the most common residue or by the 
residue at the M4 receptor. Residues K*® (M1), D° (M1), and S*® (M3) are 
shown as alanine owing to a lack of electron density on the side chains. 


distribution of the allosteric site and can explain why some of the 
best-studied muscarinic receptor allosteric modulators are cationic 
compounds*. For example gallamine*’, a prototypical negative 
allosteric modulator of muscarinic receptors, has a binding potency 
order of M2 > M1, M4 > M3, M5 (ref. 33). The acidic EDGE sequence 
(Fig. 3b) of the M2 receptor has been shown to be important for 
gallamine affinity and cooperativity; indeed, replacement of M1 resi- 
dues LAGQ with the EDGE (Fig. 3b) significantly improved gallamine 
affinity at the M1 receptor®?. 

Interestingly, inspection of our M4 receptor data also revealed that 
the precipitant, polyethylene glycol 300 (PEG 300), is able to occupy 
the allosteric binding site of the inactive-state receptor (Extended Data 
Fig. 5), a finding consistent with the recent structure of the M3-mT4L 
receptor”°. Surrounding the PEG 300 molecule are residues that form 
the allosteric site from the top regions of TM2, TM3, and TMs 5-7 
(Extended Data Fig. 5b). Furthermore, PEG 300 sits immediately above 
the aromatic cage composed of Y1 13°33, 416° 5!, Y4397°°, and Y44373 
(Extended Data Fig. 5a). These residues have been implicated in reg- 
ulating the dissociation of antagonists from the orthosteric binding 
site**, and we confirmed the ability of PEG 300 to act as an allosteric 
modulator in its own right through its ability to retard the dissociation 
of H]NMS in a concentration-dependent manner with a calculated 
apparent affinity of approximately 10 mM for the [>H]NMS-occupied 
M4 receptor (Extended Data Fig. 5c, d). 

Our finding above illustrates an inherent difficulty in obtaining inactive- 
state structures with prototypical negative allosteric modulators bound 
in the open muscarinic extracellular vestibule, as PEG 300 is a required 
precipitant and is present at concentrations of over 1.0 M. However, a 
recent breakthrough was the solution of the active-state structure of 
the related M2 muscarinic receptor bound to a high efficacy agonist, 
iperoxo, in the absence or presence of the PAM, LY2119620, which 
preferentially bound in a more tightly closed vestibule that arises in 
the active-state*>. Because the M4 receptor is most closely related 
to the M2 subtype, and M4 receptor PAMs are highly pursued as novel 
therapeutic agents**°, we undertook a combined mutagenesis and 
molecular modelling study to complement our structural work and 
gain additional insights into mechanisms governing positive allosteric 
modulation at this muscarinic receptor subtype. 

We investigated the interaction between the well-characterized PAM, 
LY2033298 (refs 13-15, 20, 21), and the cognate agonist, acetylcholine. 
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Figure 3 | Electrostatic and surface properties of the different 
muscarinic receptor structures. a, Electrostatic potentials (+5kT/e 

in blue and -5kT/e per electron in red) mapped on the surfaces of the 
M1-M4 receptor structures calculated at pH 7.0 using the programs 
PDB2PQR**“ and APBS“. b, Residues in ECL2 that make up the EDGE 
sequence at the M2 receptor and the corresponding regions at the other 
subtypes are shown as sticks. Negatively charged residues in the sequence 
alignment are coloured red. 


On the basis of the recent structure of the active M2 receptor bound 
to the LY2033298 congener, LY2119620 (refs 35, 37, 38), it is likely 
that such PAMs bind to an essentially pre-formed closed state of the 
extracellular vestibule. As such, residues whose mutation might alter 
the cooperativity between acetylcholine and LY2033298 fall into three 
general categories: (1) those that make tighter contacts with the ligands 
in the closed state than the open state; (2) those that are immobilized 
by the binding of either ligand, such that the entropic cost is paid by the 
first binding event; (3) non-ligand-contact residues that alter the free 
energy of activation of the receptor and thus the open to closed transi- 
tion. We chose to focus on residues within and between the extracellular 
vestibule and orthosteric sites, which are likely to reflect the first two 
categories; mutagenesis of non-contact residues that govern the free 
energy of receptor transitions are beyond the scope of the current work. 

Because prior mutagenesis studies suggested a role for aromatic res- 
idues in receptor interaction with LY2033298, we generated alanine 
mutations of selected aromatic residues near the top of the receptor and 
applied an allosteric ternary complex model to the data (Methods) to 
determine the effect of each mutation on the affinity of acetylcholine 
(Ka) or LY2033298 (Kg) for the free receptor and the magnitude of 
positive cooperativity (a) between the two ligands. We also chose to 
investigate selected (non-aromatic) residues that line the proximal and 
distal ends of ECL2, given the important role this region plays in the 
binding of modulators to the extracellular vestibule!**°-"". The results 
of these experiments are summarized in Supplementary Tables 2-4 and 
include prior mutagenesis results from our laboratory for the same set 
of ligands. To rationalize our findings, we used the recent active state 
M2 receptor structure as a template to generate a homology model of 
the M4 receptor bound to acetylcholine and LY2033298, and compared 
this with our inactive state crystal structure (Fig. 4 and Extended Data 
Fig. 6). 

The most dramatic effect on the affinity of the PAM was noted 
upon mutation of W4357°° at the top of TM7, with a complete loss 
in LY2033298 binding, similar to our previous observations”! upon 
alanine substitution of F186"C'? (Fig. 5, Extended Data Fig. 7 and 
Supplementary Tables 3 and 4). Alanine mutations of residues Y113°3, 
Y416°°!, and Y4397°, which form the roof of the orthosteric site, led 
to significant decreases in cooperativity. A slight increase in modula- 
tor affinity and significant decrease in cooperativity was also noted 
with mutation of Y89°!, together with our prior identification of res- 
idues W108°?8 and L109? as likely contributors to the PAM binding 
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Figure 4 | Model of an active M4 receptor bound to acetylcholine and 
LY2033298. a, Comparison of the M4etiotropium receptor structure 
(blue) versus an active-state model (orange) bound to acetylcholine and 
LY2033298 on the basis of the active M2eiperoxoeLY2119620 structure 
(PDB accession number 4MQT) as viewed from the membrane. The active 
M4 model was aligned to the M4etiotropium structure (chain A, excluding 
TAL) with a root mean squared deviation of 0.9 A. b, c, Cross-sectional 
views of the (b) allosteric site and (c) orthosteric site as viewed from the 
extracellular side with a 90° rotation relative to the membrane from a. 
Residues surrounding each site are shown as sticks (several removed for 
clarity). Acetylcholine and LY2033298 are shown as (a) spheres or (b, c) 
sticks and coloured according to element: carbon, cyan; oxygen, red; 
nitrogen, blue; sulfur, yellow; chlorine, green. Acetylcholine is in the trans 
conformation and aligns in a similar pose to iperoxo (c, transparent yellow 
sticks). 


pocket?®. Comparison of our inactive state structure to the active state 
model now provides a mechanistic rationale for our findings, specif- 
ically a contraction of the extracellular vestibule that results predom- 
inantly in an inward movement of N423658, F1862CL2 and W43573> 
allowing 1-stacking interactions to occur with the modulator in the 
active state (Fig. 4b). For the acetylcholine-binding pocket, there is 
a contraction of the pocket mediated by an inward movement of the 
top of TM6 to accommodate the large difference in size between ace- 
tylcholine and tiotropium resulting in significant movement of resi- 
dues Y416°°!, N417°>, W413°"%, and Y4397*? (Fig. 4c). Additionally, 
D112? is reoriented to interact with the choline head-group of acetyl- 
choline, and is no longer stabilized by the same hydrogen bond network 
that is seen in the inactive state (Fig. 1g). 

Importantly, mapping of the amino-acid residues that significantly 
affect the cooperativity between acetylcholine and LY2033298 upon 
mutation also identified, for the first time, a network that appears to 
link the allosteric and orthosteric sites, involving the interface between 
TMs 2, 3, 6, and 7, and extending along the top of ECL2 (Fig. 5; orange 
coloured residues); this network is consistent with views of allosteric 
modulation that propose a preferred energetic link between orthosteric 
and allosteric sites** but, to our knowledge, has never been directly 
mapped before in a GPCR. Interestingly, a comparison of the side-chain 
locations between the inactive M4 structure and active M4 model for 
residues in the allosteric network reveals that the majority of residues at 
the TM2/3/7 interface that contribute to cooperativity are not predicted 
to undergo appreciable movement between states, whereas compari- 
son of residues further away from the interface (F186FC2, Y416%>!, 
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Figure 5 | A cooperativity network at the M4 receptor. a, Changes in 
either LY2033298 binding affinity (ApKg, coloured black) or cooperativity 
(Aloga, coloured orange) relative to wild-type M4 are shown for 

each mutation. Data represent the mean + s.e.m. from at least three 
experiments performed in duplicate. Statistical differences between 
pharmacological parameters at wild-type versus mutant M4 receptors 

are indicated by asterisks and were determined by one-way analysis of 
variance with Dunnett’s post hoc test, where P < 0.01 (Supplementary 


N423°*8, and W435’*°) are predicted to move significantly between the 
two states (Extended Data Fig. 8). The TM2/3/7 interface, which forms 
part of the hydrophobic core of the receptor, may act as a hinge mediating 
conformational rearrangements in the extracellular vestibule between 
the inactive (open extracellular vestibule) and active (closed extracellular 
vestibule) states of the receptor. Disruption of this hinge by mutagenesis 
alters the packing interactions within the interface and might change 
the energetic barrier between the open and closed conformations of the 
receptor leading to either an increase or decrease in PAM cooperativity. 
Thus, binding of a PAM to the allosteric site might stabilize the confor- 
mation of the allosteric network residues that are otherwise found in 
amore dynamic state. Presumably, structures of the inactive state and 
active M4 model described here represent the lowest energy conforma- 
tions, as they were obtained using crystallography, or are based on the 
X-ray structures of the active M2 receptor* (Extended Data Fig. 8). 

Another noteworthy feature of LY2033298 is that it is selective 
towards the M4 receptor versus the M1 receptor when tested against 
acetylcholine’. This difference in selectivity could arise either through 
differential binding affinities of LY2033298 or through a difference in 
the cooperativity between LY2033298 and acetylcholine between the 
two subtypes (Extended Data Fig. 9). 


Table 3) or P< 0.05 (for previously determined mutations; Supplementary 
Table 4) were considered statistically significant. Cooperativity and 
binding values for F186"! and W4357*° were not determined owing to a 
lack of LY2033298 binding (see Supplementary Tables 2-4). b, c, Residues 
from a were mapped onto the M4 active-state model and coloured as 
orange sticks with translucent spheres with views from (b) the membrane 
and (c) the extracellular side. LY2033298 and acetylcholine are shown as 
sticks and coloured the same as in Fig. 4. 


Conclusions 

Muscarinic receptors remain important drug targets, and designing 
molecules to selectively target the orthosteric binding site has proved 
challenging, as highlighted by the lack of prominent differences 
between the receptor subtypes. Alongside the previously determined 
M2 and M3 structures, the M1 and M4 structures presented here now 
offer a near complete view of the inactive state of this important sub- 
family of GPCRs. Excitingly, comparison of these structures clearly 
reveals a divergence in residues lining the allosteric site, highlighting 
the importance of this region for designing selective drugs. Moreover, 
our enriched structure—-function analysis of the M4 receptor indicates 
that it is possible to combine crystal structure and mutagenesis data 
to uncover new insights into GPCR allosteric modulation, and our 
results point to the TM2/3/7 interface as a network for further studies 
on the mechanistic basis of allostery at class A GPCRs. Together with 
the recent solution of the inactive M2 and M3 receptors, as well as 
the active and PAM-bound M2 receptor, our study has contributed 
to an emerging picture of mechanisms of allostery at a therapeuti- 
cally important receptor family that may facilitate the design of novel 
agents targeting a variety of CNS disorders while avoiding peripheral 
off-target effects. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

M1 and M4 receptor expression and purification. The human M4 muscarinic 
receptor gene (http://www.cdna.org) was cloned into a modified pFastBacl 
vector to give a receptor containing an N-terminal Flag epitope tag and a car- 
boxy (C)-terminal 8 x histidine tag. Residues 226-389 of ICL3 were removed 
and replaced by a minimal Cys-free T4 lysozyme fusion protein”®, The human 
M1 muscarinic receptor gene was also cloned into the modified pFastBacl vec- 
tor, and residues 219-354 of ICL3 were removed and replaced by a Cys-free T4 
lysozyme fusion protein. Both fusion proteins were expressed using the Bac-to-Bac 
Baculovirus Expression System (Invitrogen) in Sf9 cells. Cells were infected at a 
density of 4.0 x 10° to 5.0 x 10° cells per millilitre, treated with 10,.M atropine, 
and harvested at 60h. Receptor was solubilized and purified in the presence of 
tiotropium as previously described for the M3 (ref. 22) receptor using Ni-NTA 
chromatography, Flag affinity chromatography, and size-exclusion chromatogra- 
phy. The N terminus of the M4 receptor was removed by cleavage with HRV 3C 
protease at a concentration of 2% (w/w) during concentration of the receptor before 
size-exclusion chromatography (~2h at 4°C). After size-exclusion chromatogra- 
phy, purified receptor was concentrated to 85 absorbance units (~50mg ml!) and 
flash frozen in small aliquots using liquid nitrogen. 

Pharmacology of crystallization constructs. Sf9 cells expressing wild-type M4 
or M4-mT4L receptor, as described above, were pelleted and washed with PBS 
three times for 1h each to remove any bound atropine. Cells were resuspended 
in binding buffer (10 mM HEPES pH 7.5, 100 mM NaCl, and 10mM MgCl) and 
flash frozen with liquid nitrogen. Saturation binding assays were performed using 
approximately 20,000 cells per well with 9 different concentrations of [7H]NMS in 
a total volume of 0.5 ml for 3h at 37°C. Competition binding assays with acetyl- 
choline and tiotropium were performed in the presence of a fixed concentration of 
[SH]NMS over 10 different concentrations of ligand for 3h at 37°C. Non-specific 
binding was measured in the presence of 10 \.M atropine, and reactions were har- 
vested by rapid filtration through GF/B filters. Data were analysed using Prism 
6.0d. Similar methods were applied for binding assays using wild type M1 and 
M1-TAL, except that [7H]QNB was used as the radioligand. 

Crystallization. Purified M1-T4Letiotropium and M4-mT4Letiotropium were 
crystallized using lipid cubic phase technology. Each receptor was reconstituted 
by mixing the protein solution into 10:1 (w/w) monoolein:cholesterol (Sigma) in 
1:1.5 parts w/w protein:lipid ratio using the two-syringe method™. For the M1 
receptor, samples of 50 nl (20-40 nl for M4) were spotted onto 96-well glass plates 
and overlaid with 800 nl (600 nl for M4) of precipitant solution for each well using 
a Gryphon LCP (Art Robbins Instruments). Glass plates were then sealed using a 
glass cover film and incubated at 20°C. Initial crystals for the M1 receptor formed 
after 24h in conditions containing 33% PEG 300, 100mM sodium acetate, and 
100 mM Bis-Tris Propane (pH 8.0). For the M4 receptor, initial crystals formed 
after 24h in conditions containing 25-40% PEG 300, 50-100 mM EDTA (pH 8.0), 
and 100mM MES (pH 5.5-6.5). M1 and M4 crystals were harvested using mesh 
grid loops (MiTeGen) and stored in liquid nitrogen before use. 

Data collection, processing, and structure determination. X-ray diffraction data 
were collected at the Advanced Photon Source at Argonne National Laboratories 
at GM/CA beamline 23ID-D. Crystals were located by initial rastering using an 
80 1m by 30}1m beam with fivefold attenuation and 1s exposure. Regions that 
contained strong diffraction were then sub-rastered using a 101m collimated beam 
with fivefold attenuation. Data were then collected with the 10j1m beam using no 
attenuation with 1-2 exposures and 1 degree oscillations. To prevent radiation 
damage, data were collected in wedges of 3-10° before moving onto either a differ- 
ent site on the same crystal or a new crystal. Diffraction data were processed using 
HKL2000 (M1 receptor) or XDS* (M4 receptor) and statistics are summarized 
in Extended Data Table 1. Both structures were solved by molecular replacement 
using Phaser*”. For the M1 receptor, the inactive M3 structure” (PDB accession 
number 4DA)J) was split into its receptor and T4L components and used as corre- 
sponding search models. The refinement was performed using Refmacs (ref. 48) 
with manual building in Coot”. For the M4 receptor, the inactive M2 structure? 
(PDB accession number 3UON) and the inactive M3-mT4L”° (PDB accession 
number 4U15) were used as search models for the receptor and mT4L fusion 
domains, respectively. The resulting model was completed by iterative refinement 
in Phenix® and manual building with Coot. MolProbity*! was used for structure 
validation, and figures were prepared using PyMol. Final refinement statistics 
are reported in Extended Data Table 1. 

Induced fit docking of pirenzepine. The inactive state structures of M1, M2, M3 
(PDB 4U15, chain B), and M4 (chain A) receptors were processed by the protein 
preparation wizard of the Schrodinger 2014-2 suite”, after deleting the lysozyme 
insertion region. Missing side chains were added by Prime and hydrogens refined 
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by minimization with the OPLS2.1 force field. Binding grids were defined using 
the default settings in Glide, centring the grid on the crystallized orthosteric ligand 
in each case. The PEG ligand in the extracellular vestibule of M3 and M4 receptors 
was deleted before grid generation. The ligand, pirenzepine, was treated with lig- 
prep software to generate initial protonated 3D structures. Compound structures 
were docked using the induced fit docking protocol with default settings, which 
involves the use of the OPLS_2005 force field to refine residues around poses 
docked by Glide SP, followed by redocking into the generated receptor conforma- 
tions, also with Glide SP. The poses with the lowest induced fit score were selected. 
This scoring function takes into account an estimate of the protein conformational 
penalty along with a protein-ligand interaction docking score. 

Molecular modelling of active M4 receptor. A homology model of a human 
active-state M4 receptor was constructed using the Prime program implemented in 
Maestro version 2014.1 from Schrodinger. The crystal structure of the M2 receptor 
with an orthosteric and allosteric agonist bound (PDB accession number 4MQT) 
was used as a template to build the M4 model. The M2-M4 sequence alignment 
generated by Prime needed no adjustment owing to the overall significant sequence 
homology between the two isoforms. The initial M4 receptor model was built 
with the allosteric ligand (LY2119620) present in the M2 crystal structure bound 
in the M4 allosteric site and with iperoxo bound in the orthosteric site (as also 
present in the M2 structure). The binding mode of LY2119620 in M4 was used as a 
guide to manually dock LY2033298 into the M4 allosteric binding site. In addition, 
iperoxo from the M4 model was manually modified into acetylcholine (ACh). The 
M4-ACh-LY2033298 complex was then subjected to 500 steps of energy mini- 
mization (MacroModel implemented in Maestro 2014.1 from Schrédinger*’) to 
optimize key interactions in the binding sites. The resulting model of ACh and 
LY2033298 bound to M4 was used in subsequent modelling studies described 
in this paper. 

Molecular modelling of active M1 receptor. The active state of the M1 receptor 
was modelled on the basis of the active state structure of M2 bound to iperoxo 
(PDB accession number 4MQT), using the automated protein structure homol- 
ogy modelling web server Swiss-Model**°°. The nanobody structure was removed 
and the resulting coordinates were used as a template to model the M1 primary 
sequence without intracellular loop 3 residues (residues 213-240). The model was 
built using Promod-II, minimized by steepest descent energy minimization using 
a GROMOS96 force field and the quality was assessed by the QMEAN scoring 
function. ACh and LY2033298 were docked in the M1 homology model using 
Swiss-Dock®, using steric and chemical considerations such as shape, charge com- 
plimentary, and keeping the protein structure constant. The top-scoring clusters 
were evaluated manually on the basis of chemical and steric considerations to pick 
the favourable pose. Owing to static docking, the top four ACh poses did not affect 
the docking results for LY2033298. For ACh, the selected pose is in the trans con- 
formation similar to the M4¢ACheLY2033298 model. Finally, the structures with 
the ligand were energy minimized using Chimera with standard Steepest Descent 
and Conjugate Gradient steps. 

Receptor mutagenesis and generation of cell lines. DNA encoding the human 
M4 mAChR with a triple HA”? or cmyc’! tag at its N terminus was subjected 
to QuikChange site-directed mutagenesis (Stratagene) to generate M4 mAChR 
sequences with the desired amino-acid substitutions. DNA constructs in pEF5/ 
frt/V5 (Invitrogen) were stably expressed in Flp-In-CHO cells (Invitrogen), which 
were maintained in high-glucose Dulbecco's modified Eagle’s medium containing 
10% FBS, 16mM HEPES, and 400 1g ml! hygromycin B. Mycoplasma testing was 
performed regularly on cell lines using the MycoAlertTM kit (Lonza); cell lines 
were mycoplasma-free before experiments were conducted. 

Radioligand binding assays. Cell membranes were prepared as described pre- 
viously'*°”. [>H]QNB affinity (Ka) at the M4 WT receptor and mutants was 
determined by saturation binding assays, performed by incubating varying con- 
centrations of [7>H]QNB with 10-100 1g of membranes at 37°C for Ih, ina final 
volume of 0.5-1 ml binding buffer (20 mM HEPES, 100mM NaCl, and 10mM 
MgCl at pH 7.4). 

Radioligand inhibition binding assays were performed by co-incubating 
10-100 1g of membranes with a Ka concentration of [*H]QNB (determined in 
saturation assays, Supplementary Table 2) and varying concentrations of the 
non-radiolabelled test compound in 0.5-1 ml binding buffer in the presence of 
the guanine nucleotide, GppNHp (100 1M), which was used to promote recep- 
tor/G-protein uncoupling. These experiments determined the concentration of 
ACh that inhibited 20% [7H]QNB binding, defined as the 20% inhibitory con- 
centration (IC29), which was used in subsequent interaction studies between [*H] 
QNB, ACh, and LY2033298. These experiments were performed by co-incubating 
10-100 1g of membranes, an IC9 concentration of ACh, and a Ka concentration 
of [>H]QNB with increasing concentrations of LY2033298 in binding buffer con- 
taining GppNHp (100 1M). The reaction was left to reach equilibrium for 3h at 
37°C. For all experiments, non-specific binding was defined in the presence of 
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101M atropine, total binding was determined in the absence of the test ligand, and 
vehicle effects were determined with 0.1% dimethylsulfoxide (DMSO). The assays 
were terminated by vacuum filtration through GF-B glass fibre filters, which were 
washed three times with ice-cold 0.9% NaCl. [7H]QNB radioactivity was measured 
using a Packard 1600 TR liquid scintillation beta counter. Owing to a lack [7H]QNB 
binding, affinity data for W164A**’ were determined from functional pERK1/2 
experiments performed as previously described?°”!. 

Data analysis. Data were analysed using Prism (GraphPad). For radioligand sat- 
uration binding, non-specific and total binding data were analysed as described 
previously**. Inhibition binding curves between [*H]QNB and ACh were fitted to 
a one-site binding model**. Interaction experiments between [*H]QNB, ACh, and 
LY2033298 were fitted to the following allosteric ternary complex model?”?)”: 


ss Bons [Al 
Kakp ) [B] a(T)[B] 
[A] a (ae). Kr Kp Kikg ) 


where Y is the specific radioligand binding, Bmax is the total number of recep- 
tors, [A], [B], and [I] are the concentrations of radioligand, allosteric modulator, 
and unlabelled orthosteric ligand, respectively, Ka, Kz, and Kj are the equilib- 
rium dissociation constants of the radioligand, allosteric modulator, and unla- 
belled orthosteric ligand, respectively, and a’ and a are the cooperativity factors 
between allosteric modulator and the radioligand or unlabelled orthosteric ligand, 
respectively. The value of a’ was taken as 1 when the binding of [7H]QNB changed 
by less than 10% at 10-> M LY2033298 relative to zero LY2033298, and was fixed as 
such for all analyses. Otherwise, the value of a’ was determined using a global fit 
to the allosteric ternary complex model. Statistical differences between pharma- 
cological parameters at wild-type versus mutant M4 receptors were determined 
by one-way analysis of variance with Dunnett's post hoc test, where P< 0.01 was 
considered statistically significant. 
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Extended Data Figure 1 | Crystallization construct design, purification 
and crystallization. a, b, Crystallization constructs used for the (a) M1 
receptor and (b) M4 receptor. All constructs contain an N-terminal Flag 
epitope (yellow), C-terminal histidine tag (purple), and a T4L lysozyme 
fusion protein (red). For the M4 receptor, initial constructs diffracted 

out to 4A; however, the diffraction data appeared to suffer from a lattice 
translocation disorder and were unsolvable. The final crystallization 
construct contained a shortened N terminus with an HRV 3C cleavage 
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Diffraction: 
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site, shown in dark green, and a minimal T4 lysozyme fusion (mT4L)”*, 
shown in red. c, Snake-plot diagram of the best diffracting M4 mAChR 
construct coloured according to a. Residues coloured blue are single-point 
mutations from this study, and residues coloured orange are previously 
studied mutations”””’. d, Size-exclusion chromatography trace of purified 
monodispersed M4-mT4L bound to tiotropium. e, Crystals of M4-mT4L 
obtained in lipidic cubic phase and observed under circularly polarized 


light. 
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extracellular side 
™1 
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c TM7 
TM5 TM6 
4 ® 
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N TED1 ECL1 - 
Ml receptor 15-VLAPGKGPWQVAFIGITTGLLSLATVTGNLLVLISFKVNIELKTVNNYFLLSLACADLIIGTFSMNLETTHBLMGHWALG 94 
M2 receptor 13-ALTSPYKTFEVVFIVLVAGSLSLVTIIGNILVMVSIKVNRHLOTVNNYFLFSLACADLIIGVFSMNLYTLYBVIGYWPLG 92 
M3_receptor 58-DPLGGHTVWOVVFIAFLTGILALVTIIGNILVIVSFKVNKOLKTVNNYFLLSLACADLIIGVISMNLBTTYDIMNRWALG 137 
M4_receptor 22-SSHNRYETVEMVFIATVTGSLSLVTVVGNILVMLSIKVNROLOTVNNYFLFSLACADLIIGAFSMNLYTVYLIKGYWPLG 101 
M5 receptor 20-QPLERHRLWEVITIAAVTAVVSLITLVGNVLVMISFKVNSQLKTVNNYYLLSLACADLIIGIFSMNLETTHELMGRWALG 99 
oe * ve 22% ks Kakkar KK Ry KaRKKK RE KRAKKAKKKRAKK pRaAKK SK * kK 
Ml_ receptor ee, NLLLISFDRYFSVTRPLSYRAKRTPRRAALMIGLAWLVSEVLWAPALLFWOYLVGERTVB 174 
M2 receptor PVVCD. ASWMNLLIISFDRYFCVTKPLTY PVKRTTKMAGMMIAAAWVLSFILWAPAILFWOFIVGVRTVE 172 
M3_receptor NLACDL SVMNLLVISFDRYFSTTRPLTYRAKRTTKRAGVMIGLAWVISFVLWAPATLFWOYFVGKRTVB 217 
M4_receptor AVVCD. SWMNLLIISFDRYFCVTKPLTYPARRTTKMAGLMIAAAWVLSEVLWAPAILFWOFVVGKRIVB 181 
M5 receptor SLACD. SMMNLLVISFDRYFSITRPLTYRAKRTPKRAGIMIGLAWLISFILWAPAILCWQYLVGKRTVB 179 
+ ae ET Trea aoe GAL RAGE ee a ae ke: para pkAKKR AK RAS LR kak 
—— Ec ICL3 —____—_—__ ey] 
Ml_receptor LPVTVMCTLYWRIYRETENRARELAALQGSETPGKGGGSSSS----FSLVKEKKAA 364 
M2 receptor LPVI IMIVLYWHISRASKSRIKKDKKEPVANQDPVSPSLVQG----PPPSREKKVT 386 
M3_receptor MPVTIMTILYWRIYKETEKRTKELAGLOASGTEAETENFVHP----MSLVKEKKAA 490 
M4 receptor LPVVIMIVLYIHISLASRSRVHKHRPEGPKEKKAKTLAFLKS——--QMAARERKVT 399 
M5 receptor TSvSUMOLEYOR YRETEKRTKDLADLOGSDSVTKAEKRKPA-~--VVLVKERKAA 441 
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Extended Data Figure 2 | Sequence conservation across the muscarinic 
receptor subfamily. a—c, The sequence alignment of the human M1-M5 
receptors (d) was determined on the ConSurf server to calculate amino- 
acid conservation scores®°*, Conservation scores for each residue were 
mapped®™®? onto the M4 structure and coloured as a gradient from 

blue (highly conserved) to red (least conserved) with views from the 

(b) extracellular and (c) intracellular sides. The radius of the cartoon 
increases as the residues at each position become more poorly conserved. 
Tiotropium and PEG 300 from the M4 structure are shown as spheres and 
coloured with carbon in white, oxygen in red, nitrogen in blue, and sulfur 


in yellow. d, Amino-acid sequences of the human M1-M5 receptors were 
aligned using the ClustalW2 server®™. Alpha helical regions are shown as 
blue boxes as determined by the consensus of the M1-M4 structures. The 
most conserved residue in each TM (X.50) is in bold lettering. Regions of 
the N terminus, C terminus, and ICL3 regions are removed for space and 
clarity. Insertion points of the T4 lysozyme fusion proteins between TM5 
and TM6 are underlined with bold lettering. Residues from the orthosteric 
binding-site are highlighted in red and allosteric binding-site residues in 
blue. Residues that contribute to both sites are coloured in yellow. 
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Extended Data Figure 3 | Distinct structural features for the M1 and 
M4 receptors. The receptors shown are aligned and coloured as in Fig. 1. 
a, b, The M1 receptor was crystallized with the Flag peptide (DYKDDDD; 
coloured cyan sticks) co-bound on the cytoplasmic surface. Residues of the 
M1 receptor within 4 A of the Flag peptide are shown as magenta coloured 
sticks with views from the (a) membrane and (b) cytoplasmic side. c, The 
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linkage between TM7 and helix 8 of the M1 receptor undergoes a bend 
starting with a change in rotamer of residue Y”°*, which may be a result 
of perturbations in TM6 due to the Flag peptide. d, The M1-N110Q?” 
mutation causes a slight bulge in TM4 due to the loss of a hydrogen bond 
with S**°. e, Chain B of the M4 receptor has an intact ionic lock with R**° 
forming hydrogen bonds with T®** and E®*°. 
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Extended Data Figure 4 | Induced fit docking of pirenzepine into experiments. b, Comparison of the pirenzepine poses for the M1 and M4 
the M1-M4 structures. The receptors shown are aligned and coloured receptor with residues that contribute to the orthosteric site of the M1-M4 
as in Fig. 1. a, Superposition of the poses of pirenzepine from the IFD receptors (several residues omitted for clarity). 
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Extended Data Figure 5 | PEG 300 occupies the allosteric binding site 
of the inactive M4 receptor. a, The cross section of the solvent accessible 
surface area of the M4 receptor is coloured blue. Tiotropium and PEG 
300 are shown as spheres with respective carbons coloured white and 
peach. The aromatic cage of covering tiotropium is highlighted in orange 
b, View from the extracellular side with residues that contact PEG 300 
shown as spheres. c, Dissociation kinetics of [>H]NMS in the presence 
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of PEG 300. [H]NMS was incubated with M4-mT4L membranes 

at 37°C for 3h, followed by addition of 101M atropine + PEG 300 at 
the indicated concentrations and time points. Representative data from 
three experiments, performed in duplicate, fitted to a one-phase 
exponential decay are shown. d, PEG 300 has an apparent binding 
affinity for the NMS-occupied receptor of approximately 

10 mM (log(IC59) = —1.95 + 0.02). 
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Extended Data Figure 6 | Ligand interaction diagrams for the MOE® for the inactive (M4etiotropium structure) and active states 
M4 receptor. a, b, The molecular interactions between the (M4eacetylcholineeLY 2033298 model). Residues with a bold outline were 
(a) orthosteric and (b) allosteric binding sites are shown by the program selected in this study or others”””! as single-point mutations. 
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Extended Data Figure 7 | Identification of key residues that govern 
LY2033298 affinity and binding cooperativity with ACh at the 

M4 receptor. Competition between a fixed concentration of [7H]QNB 
and increasing concentrations of ACh (black circles), LY2033298 (blue 
triangles), or LY2033298 in the presence of an IC29 concentration of 
ACh (red squares) are shown. The curves drawn through the points 


represent the best global fit of an extended ternary complex model. For 
data sets where the binding of [7H]QNB changed by less than 10% at 
10-°M LY2033298 relative to zero LY2033298, the value of a’ was fixed to 
1 (connecting line shown). Data points represent the mean +s.e.m. of at 
least three experiments performed in triplicate. 
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Extended Data Figure 8 | Comparison of cooperativity network 
residues between the inactive and active-states. a, b, Chemical structures 
of (a) the M4 ligands used in this study and (b) the M2 ligands from 

the active-state crystal structures (PDB accession number 4MQT and 
4MQS). c-f, Mapping of the allosteric network onto the (c, d) inactive M4 


structure (blue residues), M4 active-state model (orange residues) and 
(e, f) the inactive (yellow residues) and active-state M2 structures 
(magenta and green residues) with views from the (c, e) membrane or 
(d, f) extracellular surface. Ligands are coloured according to element: 
carbon, cyan; oxygen, red; nitrogen, blue; sulfur, yellow; chlorine, green. 
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Extended Data Figure 9 | LY2033298 binding to active-state M1 and 
M4 models. Comparison of active-state M1 (green) and M4 (orange) 
models bound to LY2033298 and acetylcholine, with acetylcholine and 
LY2033298 shown as sticks and coloured according to element: carbon, 
white; oxygen, red; nitrogen, blue; sulfur, yellow; chlorine, green. Several 
residues surrounding LY2033298 are shown as sticks and coloured 
according to receptor. M4-N42 
movement between the inactive and active states to form a hydrogen bond 
with the methoxy group of LY2033298. In the M1 receptor this residue 
is a serine (S388°°%) and is unable to form a similar hydrogen bond. 
However, mutation of N423°** to alanine at the M4 receptor results in no 
loss of LY2033298 affinity, but does result in a sixfold loss in cooperativity 
between acetylcholine and LY2033298 (Supplementary Table 3). This is 
suggestive of selectivity being derived through cooperativity as a possible 
mechanism between the M1 and M4 receptors. Additional determinants 
for M1 and M4 selectivity could also arise through differences in residues 
on TMs 2 and 7, which contribute to (I93?°°) or sit proximal to (D4327*? 
and $4367*°) the allosteric network. 


368 is predicted to undergo significant 
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Extended Data Table 1 | Data collection and refinement statistics 


Data collection* M1-T4Lsetiotropium | M4mT4Lstiotropium 
Beamline GM/CA GM/CA 
23-ID-D 23-ID-D 
Number of crystals 8 64 
Space group P2121 21 Piz l 
Cell dimensions 
a, b, c (A) 58.0, 72.2, 175.7 48.5, 172.0, 60.7 
a, B, y (°) 90, 90, 90 90, 94.4, 90 
Resolution (A) 30.00—2.70 (2.80-— 28.99-2.60 (2.69- 
2.70) 2.60) 
Rmerge (%) 15.5 (83.6) 21.3 (94.7) 
</ol> 8.7 (1.9) 82 (1.5) 
CCi2 (%) N.D.? 99.0 (54.5) 
Completeness 97.1 (98.2) 99.7 (97.9) 
Multiplicity 4.3 (4.2) 13.1 (4.9) 
Refinement 
Resolution (A) 30.00—2.70 28.53-2.60 
No. of reflections working / test set 19,223 / 1,011 30,299 / 2,146 
Rwork/R free (Yo) 2281 275 22.7 / 24.0 
No. of atoms (Chain A / Chain B) 
Protein 3439 3058 / 3115 
Ligands 210 176 / 167 
Average B-factors (Chain A / Chain B; A?) 73.6 74.0 / 70.5 
Receptor 62.5 63.9 / 64.6 
T4 lysozyme 64.4 99.5 / 85.3 
Tiotropium 66.5 47.4/47.9 
Allosteric site PEG 300 — 74.9 / 81.6 
Waters 58.1 53.7 / 46.7 
Other ligands 74.8 80.8 / 78.4 
RMS deviation from ideality 
Bond length (A) 0.009 0.002 
Bond angles (°) 1.39 0.61 
Ramachandran statistics? 
Favored regions (%) 96.2 99.2 
Allowed regions (%) 3.6 0.8 
Outliers (%) 0.2 0.0 


*Highest shell statistics in parenthesis. 
TN.D., Not determined, because the structure was solved before CC1,2 values were introduced®©. 
4As calculated by Molprobity. 
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Positron annihilation signatures associated with the 
outburst of the microquasar V404 Cygni 


Thomas Siegert!, Roland Diehl, Jochen Greiner!, Martin G. H. Krause!*, Andrei M. Beloborodov’, Marion Cadolle Bel‘, 
Fabrizia Guglielmetti’, Jerome Rodriguez*, Andrew W. Strong! & Xiaoling Zhang! 


Microquasars!~“* are stellar-mass black holes accreting matter 
from a companion star® and ejecting plasma jets at almost 
the speed of light. They are analogues of quasars that contain 
supermassive black holes of 10° to 10!° solar masses. Accretion in 
microquasars varies on much shorter timescales than in quasars 
and occasionally produces exceptionally bright X-ray flares®. 
How the flares are produced is unclear, as is the mechanism for 
launching the relativistic jets and their composition. An emission 
line near 511 kiloelectronvolts has long been sought in the emission 
spectrum of microquasars as evidence for the expected electron- 
positron plasma. Transient high-energy spectral features have 
been reported in two objects”*, but their positron interpretation? 
remains contentious. Here we report observations of 1-ray 
emission from the microquasar V404 Cygni during a recent 
period of strong flaring activity!°. The emission spectrum around 
511 kiloelectronvolts shows clear signatures of variable positron 
annihilation, which implies a high rate of positron production. 
This supports the earlier conjecture that microquasars may be the 
main sources of the electron-positron plasma responsible for the 
bright diffuse emission of annihilation +-rays in the bulge region 
of our Galaxy'!. Additionally, microquasars could be the origin 
of the observed megaelectronvolt continuum excess in the inner 
Galaxy. 

Flaring activity of V404 Cygni was discovered with the Swift/BAT 
and MAXI X-ray monitors!°, and observations with INTEGRAL” 
started within two days, lasting from 17 to 31 June 2015. This recent 
activity period of V404 Cygni—11 days (15-26 June)—was shorter 
than its previous outbursts! in 1934 and 1989. The source exhibited 
multiple flares within hours, outshining’’ the brightest persistent X-ray 
source in the sky, the Crab nebula, by factors of up to 40. V404 Cygni 
represents a ‘gold standard’ for multi-wavelength observations, as the 
parameters of the binary system are well known. It is composed of a 
9-solar-mass black hole with a companion star of 0.7 solar masses'* in 
a 6.5-day orbit’® and a 67° inclination angle“ to the line of sight, and is 
located at a distance of 2.4 kpc (ref. 16). 

INTEGRAL/SPI spectrometer data were extracted and calibrated 
following the standard procedures, including careful accounting for 
the detector response and background (see Methods section ‘SPI data 
extractiom). For spectral analysis, the data have been summed into 
three epochs of ~3 days during the V404 Cygni flaring period. In each 
of these three epochs (INTEGRAL orbits 1554, 1555 and 1557) we have 
detected a significant excess of emission around 511 keV, consistent 
with positron annihilation. 

At energies below 200 keV, the spectrum is well described by the stand- 
ard model!’ of thermal Comptonization plus reflection, but at higher 
energies a large excess (~18s.d.) appears above its high-energy tail 
(see Fig. 1). To further quantify this excess, we added a model spectrum 
of electron-positron plasma with temperature T (see Methods section 


‘Spectral fitting; Extended Data Fig. 1 and Extended Data Table 1); 
the temperature serves as a measure of the annihilation line width. 
The characteristic curved spectral shape of pair annihilation emission 
describes the excess in all three epochs well. Its inclusion leads to a 
5-s.d. overall improvement of the fit compared to a model describ- 
ing the excess with a power law (see Extended Data Tables 2-4 for 
details, including model alternatives). No shift with respect to the 
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Figure 1 | Spectral evolution of V404 Cygni. a-c, Spectra in the soft y-ray 
band in three different flaring epochs (a—c show the spectra measured 

in INTEGRAL orbits 1554, 1555 and 1557, corresponding to epochs 1, 2 
and 3, respectively). Data (blue; error bars, 1 s.d.) are fitted as the sum of 
the Comptonization continuum (black dashed curve) and annihilation 
radiation from a relativistic hot plasma (red dashed curve). The standard 
thermal Comptonization model (grey dashed curve) fits the data up to 
~200 keV; it declines exponentially at higher energies and falls short of 
the observed flux. Our conservative, modified continuum model follows a 
power law instead. E, energy; Fz, flux at energy E. 
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Figure 2 | Model alternatives for epoch 3. Shown is the high-energy 

part of the spectrum in epoch 3 fitted with different annihilation models. 
Data are shown blue in both panels (error bars, 1 s.d.). In both panels, the 
black dashed curve shows the Comptonization continuum, and the black 
solid line, the total model spectrum. a, Two components of annihilation 
in flight in a thermal plasma: kT ~ 4keV redshifted by ~10% (red dashed 
curve), and kT ~ 500 keV (red dotted curve). b, Positronium three-photon 
annihilation redshifted by ~10% (red dashed curve) and annihilation in 
flight in a hot plasma (red dotted curve). 


laboratory frame of the annihilation line is required to fit the data in 
epochs 1 and 2. A redshift of ~10% is required in epoch 3. The line 
width varies strongly between the three epochs—the temperature 
parameter varies from a few keV to about 200 keV. 

Electron-positron pair production is expected near luminous accret- 
ing black holes when their radiation spectra extend above 
E=m,c?=511keV (refs 18, 19), where m, is the mass of the electron, 
and c is the speed of light. Pairs are produced in collisions between MeV 
~-rays, with an average cross-section a 10~*° cm”. This process is 
efficient because of the small size of the source, radius r~ (3-10) Tgp 
where r,~ 10km is the gravitational radius of the black hole. During the 
V404 Cygni flares, the observed luminosity in photons of energy 

mec’ increases up to L; 10°” ergs” ', which corresponds to a photon 
density n, ~L,/(nr?m.c’), and an optical depth for collisions between 
soft \-rays of T,,,-¥n0-,r. Gamma-rays of higher energies are absorbed 
more efficiently as they can interact with the more numerous X-rays of 
lower energies. The large optical depth 7. 1 tends to suppress the 
spectrum of the central compact source at photon energies E >> mc’, 
consistent with the upper limit on the GeV emission (obtained from 
analysing public Fermi/LAT data, Fey < 10~° photons cm~?s~|; see 
Methods section “FERMI/LAT data analysis’). Thus, a significant frac- 
tion of the luminosity is converted to pair plasma: the plasma is contin- 
ually created and annihilated inside the source, forming a broad 
annihilation line of width equivalent to kT ~ 100 keV (ref. 19). Photon 
collisions outside the source create a pair outflow”’. Pairs annihilating 
outside the source are in Compton equilibrium with radiation at a tem- 
perature of a few tens of keV. Pair outflows have also been invoked to 
explain the shape of the X-ray spectra of accreting black holes, in par- 
ticular their flat slopes and the reduced reflection component”). 

These expectations are confirmed by our observations of a broad 
annihilation feature in epochs 1 and 2. The observed flux corresponds 
to a positron creation rate of N,. ~ 10*s~}, and to an energy generation 
rate in the form of pairs, L., of Li ~Nim,c?~ 10 ergs"!, a few 
per cent of the source luminosity’’, in agreement with models””. 
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Pair plasma annihilating outside the source is then consistent with 
observations in epoch 1 (kT + 30keV), while annihilation inside the 
source corresponds to the broad line observed in epoch 2 (kT ~170keV). 

In epoch 3, there is a hint of a steep decline on the blue side of the line 
that is characteristic of three-photon annihilation of positronium atoms 
(see Methods section ‘Spectral fitting’). Figure 2 shows the data fitted 
by a redshifted, narrow annihilation feature (Fig. 2a), or alternatively 
by a positronium-annihilation spectral shape (Fig. 2b). In addition, 
the broad component extending above 511 keV may be fitted by a very 
hot annihilation line; this more complex model would correspond to 
two annihilation regions, cold and hot (see Extended Data Table 5). 

The smaller width of the line in epoch 3 (kT + 2-3 keV), in combina- 
tion with the detected redshift, poses a challenge to models. The 10% 
redshift is consistent with a gravitational redshift of the source at radius 
r= 10rg. A line emitted in this region is expected to be significantly 
broader, owing to the mildly relativistic bulk motions of the plasma 
and dispersion in gravitational redshift. 

The positronium interpretation also poses problems, because it 
requires” a dense plasma with a low temperature, T < 10°K. The min- 
imum temperature of a source with luminosity L ~ 10°8 ergs! and 
size r= 10rg 100 km is its effective blackbody temperature, To 
[L/(4nr7opp)]/4 = 107 K, where opp is the Stefan-Boltzmann constant. 
The accretion flow at larger distances may contain much cooler gas, 
especially if the central radiation is beamed and shielded by the inner 
accretion disk. Then the pair plasma created in the central source may 
be blown out by radiation along the magnetic field lines towards the 
colder gas, decelerate there and form positronium before annihilating. 
This scenario is, however, inconsistent with the redshift of the line, 
in particular when interpreted as a gravitational redshift. 

The morphology of the material near the black hole may also be 
inferred from soft X-ray observations at high spectral resolution”, sug- 
gesting high ionization out to r2 x 10°km. This is a plausible value for 
the outer edge of the accretion disk. Variability of X-ray line strengths 
and a changing ionization strongly suggest that most of the X-rays are 
not received directly but are observed after reflection/reprocessing 
in the outer region. This is consistent with the flat shape of the hard 
X-ray continuum”? up to 100keV. In this picture, the outer edge of the 
disk is elevated, blocking our direct view towards the inner disk, and the 
X-ray continuum variability could be due to changes in the outer disk, 
rather than to rapid changes in the accretion rate onto the black hole. 

The flux evolution of the continuum and annihilation features 
throughout our observation period is shown in Fig. 3. The two com- 
ponents appear to be correlated. When the X-ray flaring of V404 Cygni 
faded, the annihilation signal also vanished. However, some annihilation 
radiation appeared between the X-ray flares. Our time resolution is 
much longer than the dynamical timescale of the inner accretion disk 
(milliseconds), the jet ejection timescale? (minutes), the light travel 
time between the two binary components (80 s), and the timescale of 
adiabatic expansion of ejected blobs? (minutes, less than 1 h). Thus, 
our measurements probe time-averaged values and do not resolve the 
rise time of the annihilation flux, which may be much shorter than 
our 6-h bins (see Methods section “Timing analysis’). Our search for a 
lag between the X-ray flares and the annihilation radiation was incon- 
clusive, although there is an indication of a lag of several hours (see 
Methods section “Timing analysis’). 

In contrast to hadronic gas, pair plasma is easily accelerated and 
immediately attains an equilibrium bulk speed v + c/2 away from the 
source (governed by the local radiation field anisotropy), forming 
the base of a relativistic outflow from the accretion disk”°. The 67° 
inclination implies a Doppler blueshift of ~10%, comparable to the 
gravitational redshift at r= 10rg. From our data in epochs 1 and 2, we 
can neither confirm nor exclude possible residual shifts of the broad 
line. Annihilation radiation around the source can be strongly affected 
by the magnetic field configuration, which can change the outflow 
speed and direction. The power deposited in the pair plasma alone of 
Ls 10* erg s-! would be sufficient to explain the observed typical 
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Figure 3 | Time history of X-ray continuum and annihilation emission. 
The two components of high-energy emission from V404 Cygni are shown 
as they evolve during the June 2015 flaring period, measured between 

17 June and 30 June. INTEGRAL orbits 1554-1558 and epochs 1-3 are 
shown at the top. The black histogram shows the spectral photon flux in 
the 100-200-keV band (left-hand y axis) and represents the brightness of 
the Comptonization component emission. The red circles show the photon 
flux in the annihilation line averaged over 6h (right-hand y axis); vertical 
error bars, 1 s.d. MJD, modified Julian date. Grey shaded areas mark the 
regions where no data have been taken. 


radio luminosity associated with escaping blobs on larger scales”®. 
A changing magnetic field may be responsible for the evolution of the 
line shape between epochs 1 and 2. 

The outflow of pairs from the central source makes microquasars 
efficient factories, enriching the surrounding medium with positrons. 
This can help to solve the puzzle of annihilation radiation observed 
from the bulge region of our Galaxy'!. In a steady state, the annihi- 
lation rate in the inner Galaxy is (1-2) x 10% positrons s~! and must 
be balanced by a positron supply with the same rate. During the 
10-day flaring period, V404 Cygni produced ~10” positronss~'. In 
more luminous microquasars such as GRS 1915+105, this number is 
expected to be even larger. Thus, the steady-state annihilation rate in 
the inner Galaxy requires only about ten such sources to be active at 
any time. With duty cycles (that is, flaring versus quiescent epochs) 
of the order of 10~, as observed for V404 Cygni, ~10?-10* accreting 
black hole binaries of this type would be required in the inner Galaxy, 
consistent with population synthesis estimates”’. 

The excess emission at a few hundred keV to a few MeV that we 
observed in the flaring epochs of V404 Cygni may also be responsible 
for the excess (about 10keV photonscm~s~! at 1 MeV; ref. 28) found 
in the diffuse +-ray continuum spectrum of our Galaxy’s ridge”®. This 
excess is not accounted for by models of cosmic-ray interactions with 
interstellar gas’s, nor by inverse-Compton emission, and had been 
attributed to an unknown source population. A few active sources at 
an average observed microquasar distance of 5 kpc (ref. 26) would be 
needed to explain the diffuse MeV excess. The implied duty cycle is 
consistent with the above estimate for ~10* sources. Thus, the same 
population of microquasars could also be the origin of the observed 
inner Galaxy’s excess in MeV continuum. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

SPI data extraction. INTEGRALS SPI >\-ray spectrometer**! provides photon 
event measurements, where pulse heights are recorded in a 19-element germanium 
semiconductor detector array. A coded mask imprints a shadowgram of celestial 
sources onto the detector array, as particular regions of the sky are blocked from 
detectors through the opaque tungsten elements of the mask. Re-orientations of the 
telescope by small angles (‘dithering, ~2°) provide an additional coding pattern. 
Coded-mask imaging allows sources within the telescope field of view of ~30° to 
be resolved with a precision of ~2°. 

We accumulate event data for each telescope pointing and detector and bin 
them into spectra, after initial preprocessing with INTEGRAL standard OSA 9 
software. We use single-detector hits only, to ensure a well-known spectral detec- 
tor response. Above 530 keV, we use pulse-shape filtered events to suppress an 
electronics malfunction that contaminates normal event data*’, and we apply their 
appropriate filter efficiency factor of 0.8. Pointings typically lasted one hour during 
the observations of V404 Cygni. 

For adequate spectral precision, we accumulate three-day orbit data to extract 
a spectrum for V404 Cygni. The full set of detector spectra over an orbit is then 
fitted with parameterized models for instrumental background and for all candi- 
date sources in the field of view**. The celestial emission models are folded into 
the data space of spectra using the coded mask shadowing properties, and the 
energy response of the detectors. Thus, intensity scaling factors per energy bin 
are obtained for all candidate sources within the telescope field of view, which 
are V404 Cyg, Cyg X-1, Cyg X-3, Cyg A, 3A1954+319 and EXO2030+375. Our 
model fit of background and sky contributions to data then provides spectra for 
all these sources. 

The background is derived from data of the two preceding INTEGRAL orbits 
(1552, 1553). Being from the pre-flaring periods of V404 Cygni and being pointed 
to regions devoid of sources at high Galactic latitudes, these data are free of any 
high-energy (>300 keV) signals from the sky. The background is modelled as a 
constant detector-intensity ratio (pattern) per energy bin, taken from these inde- 
pendent data. Owing to the strong time variability, the overall amplitudes per 
energy bin (background-intensity scaling factors) have been determined on a 
one-hour timescale. 

The continuum sensitivity (3 s.d.)*! around 400 keV for an exposure of two days 
(corresponding to the dead-time corrected on-time of one orbit) is about 3.0 x 10-° 
photonscm ~*s~'keV_! with only a weak dependence towards higher energies 
(about 1.8 x 10~° photonscm~*s~! keV~! at 1,600 keV). This is sufficient to detect 
sources with intensities of ~100 mCrab above 400 keV. V404 Cygni, for example, 
shows an average intensity of (12.0 3.5) x 10-° photonscm~*s-!keV! around 
500 keV, and about (2.5+ 1.6) x 10~° photons cm~?s~! keV~! around 1,000 keV, 
during its flaring epochs (see Fig. 1), in accordance with sensitivity limits. Thus, at 
higher energies, statistical uncertainties dominate over systematics. 

Some cross-talk among the sources is expected to occur in the data of a 
coded-mask instrument with a large field-of-view. Also, concerns about arte- 
facts from background model inadequacies need special attention. We therefore 
compare the spectra of a set of hypothetical sources fitted additionally in the 
empty regions of the field-of-view, together with the above-mentioned known 
sources, as commonalities or anti-correlations among them would be a hint 
of background or cross-talk issues. For a given data set, the total number of 
measured photons is fixed, so that the inclusion of test sources may result in an 
increased flux value for one source while at the same time reduce the flux for 
another (anti-correlations). But apart from clear signals in the spectra of the real 
sources, all source spectra in the empty field regions are fully consistent with 
background only, and in particular do not show any spectral signatures beyond 
statistical fluctuations. 

Spectral fitting. The spectra for the three flaring epochs were fitted as a sum of two 
components: a Comptonized continuum C(E) and pair annihilation emission P(E) 


F(E) = C(E) + P(E) (1) 


Continuum X-ray spectra of accreting black holes in the hard state are well 
explained by the thermal Comptonization model!”*4; seed low-energy pho- 
tons (for example, thermal radiation from the optically thick accretion disk) 
are re-processed by a hot plasma corona through Compton scattering to higher 
energies, leading to a power-law spectrum. The power law cuts off exponen- 
tially where the photon energy exceeds the mean energy of the scattering elec- 
trons, which is typically ~50-100 keV (refs 17, 34). This model describes the 
spectra of V404 Cygni reasonably well below ~200 keV, between and during 
flaring’ . However, additional emission is required above 200 keV. One can 
model this emission as an additional power-law component due to scattering by 
non-thermal, accelerated particles. We use the following approximation to the 
Comptonization spectrum 


with By = Ao ( = yr exp(#2= Ex ) where Ep = 100 keV is a normalization con- 
vention. . : 

The model of thermal pair annihilation (TPA)'**°, P(E), is calculated as follows. 
Electron and positron energy distributions are described by a Maxwell-Jiittner 
distribution, which is the relativistic Maxwell-Boltzmann distribution 


a (-7] (3) 


f= aap) 0 


Here y= (1 — 3°)~!? is the Lorentz factor, 3= v/c is the dimensionless velocity, 
d= _ is the dimensionless temperature of the pair plasma with Boltzmann's 


mc 
constant k, electron (positron) mass m, the speed of light c, and K3 is the modified 
Bessel function of second kind. The spectral distribution of annihilation photons 
hv 35 


can be calculated in terms of a dimensionless photon energy x = — as 
mc 


dn 
dt 


2 x 
PD aa exp a 


(4) 
J 4:2(5— 1)oaas)exp{ — 5] 


Here, nx are the number densities of the positrons/electrons, s = xe is the photon 
momentum in the centre-of-momentum frame, and @ann(s) is the cross-section for 
the annihilation process. We have used simplified expressions for the temperature 
and energy regions of interest®* which are accurate to 0.04% across the range 
presented in this analysis. 

The TPA model (P(E), equation (4)), shown in Extended Data Fig. 1, does not 
include possible Doppler shifts due to the bulk motion of the pair plasma or the 
gravitational redshift, which may be significant when the emission originates near 
the black hole. 

The fitted parameters in our composite model, f(E), are the normalization of 
continuum at 100 keV, that is, the amplitude Ao, the low-energy power-law index a, 
the cutoff energy Ec, the e-folding energy Es, the high-energy power-law index 
, the extrapolation energy Ex, the amplitude of the annihilation feature nn_c, 
and the temperature of the pair plasma Typa. The parameters Ep and Bo do not 
influence the fit and are only introduced for convenience and clear arrangement. 

From these parameters, we calculate the following derived parameters. Irpa is 
the differential flux integrated over the energy 


+00 
Irpa = J P(E)dE (5) 


The plasma temperature Trp, is multiplied by the Boltzmann constant, k, fora 
conversion into keV units. Fitted and derived parameters for all epochs can be 
found in Extended Data Tables 1, 3 and 5. 

The fit quality obtained for our one-orbit spectra is not satisfactory at first 
glance, with 7 values of 63.7, 69.2 and 280.9 for 38 (37) d.o.f. (degrees of freedom) 
(see Extended Data Table 2). Typically, an observation is dominated by a large 
instrumental background. But below ~200 keV, the number of photons detected 
from V404 Cygni is very high, (about five times stronger than the background), 
thus systematics of the instrument response limit the accuracy of the fit. At ener- 
gies above ~150keV, our composite model obtains a satisfactory fit quality with 
x? values of 13.0 to 14.5 for 14 d.o.f. Overall, we detect a large high-energy excess 
above the conventional Comptonized cut-off description, with a total significance 
of ~18s.d., consistent with high-energy excesses reported in other studies!*?>**, 
Our more conservative analysis uses a phenomenological model of the continuum 
that avoids the high-energy cut-off of the Comptonization model by modifying 
it to a power-law extension towards high energies. When we use this modified 
Comptonization description, and add a model spectrum of electron—positron 
plasma of a temperature T, we find a significance of 5 s.d. altogether (see Extended 
Data Table 4) for the pair annihilation component above the Comptonization 
component. Significances have been estimated by \” goodness-of-fit tests, in 
which extra model components are zeroed in order to evaluate the improvement 
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in x7, compared to the simpler model with fewer parameters. The test statistics 
therefore follow a \?-distribution with 1 d.o.f., in which data are properly scaled by 
their statistical uncertainties which account for Poissonian fluctuations of source 
and background. 

The fit shown in Fig. 2b assumes a different annihilation model, more typical 
when positrons are slowed down by, for example, Coulomb interactions before 
annihilating*”. This has been well measured in our Galaxy!!, and in terrestrial 
laboratory experiments**. When positrons annihilate at thermal energies, the spec- 
tra of positron annihilation show the intermediate formation of a positronium 
atom, consisting of a positron and an electron. This process is very efficient at low 
energies and in particular below a threshold energy of 6.8 eV (equivalent to tem- 
peratures below 7.8 x 10‘ K). Positron annihilation then can occur from a singlet 
state of positronium with two photons at 511 keV (para-positronium), or a triplet 
state with a three photon annihilation continuum spectrum” rising in intensity 
up to a maximum at 511 keV (ortho-positronium). 

We define the fitted parameters of the additional components as follows. Ecentroid 
is the Doppler-shifted peak position of the low-energy annihilation feature, that is, 
pair plasma annihilation in Fig. 2a, and the sharp edge of the ortho-positronium 
shape, O(E), in Fig. 2b. The plasma temperature kT rpg is only given for cases in 
which a thermal pair annihilation model has been fitted to the data. In both cases, 
the additional components are scaled/normalized by a fitted amplitude Op. 

We derive the flux, Fs1;, of the second features by integrating the differential spec- 
tral shape over the energy. For Fig. 2a, see equation (5), for the ortho-positronium 
feature, the flux is given by 


+00 


Foi = f O(E)dE = Oo(n? — 9) (6) 


oo 


Timing analysis. For the timing analysis of the annihilation emission, we only 
fit the amplitudes of our model components, Ao of the continuum, and n,.n_c of 
the annihilation feature (see equations (2) and (4) and Fig. 3), in hourly time bins, 
while holding the spectral shape, that is, all other parameters in both continuum 
and annihilation model, fixed. This assumes a constant thermal pair annihilation 
temperature Typ, during one epoch and sums all emission which is spectroscop- 
ically identified as due to positron annihilation. The rise time of the excess flux is 
estimated as a factor of two increase in the positron annihilation component flux. 
This is difficult to estimate, as it is limited by photon statistics; we estimate a flux 
doubling within ~2h. This interprets all fitted signatures on top of the continuum, 
C(E), as pair plasma emission, rather than accelerated particle emission represented 
by a power law. In particular, above 500 keV, this may incur a bias towards high 
annihilation flux values, compared to the detailed spectroscopic analysis per epoch. 

The time evolution of the hard X-ray continuum emission (100-200 keV) 
is obtained’ from fitting in one-hour intervals the coded-mask response to all 
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sources in the field together with the background model, adjusting the background 
normalization coefficient per hour. 

The linear Pearson correlation coefficient between the 100-200 keV band and 

the annihilation flux of V404 Cygni during MJD 57191 and 57203 in hourly time 
bins is 0.45 for zero lag, and 0.34 for a lag of ~15h. 
FERMI/LAT data analysis. We have analysed the all-sky survey data of the 
FERMI/LAT instrument taken during the 15.5-h time window when we see the 
largest positron flare, in epoch 3 at orbit 1557, corresponding to MJD 57199.616 
to 57200.261. The Pass8 data of a 20° circle around V404 Cygni in the energy 
range 100 MeV to 10 GeV have been retrieved from the Fermi Science Support 
Center (http://fermi.gsfc.nasa.gov/cgi-bin/ssc/LAT/LATDataQuery.cgi; accessed 
6 August 2015). We employed the unbinned likelihood analysis as implemented 
in the user-contributed LATAnalysisScripts (http://fermi.gsfc.nasa.gov/ssc/data/ 
analysis/scitools/LATAnalysisScripts.html; accessed 6 August 2015) provided 
by FSSC, after changing to Pass8 details. Standard event cleaning/removal was 
applied (http://fermi.gsfc.nasa.gov/ssc/data/analysis/documentation/Cicerone/ 
Cicerone_Data_Exploration/Data_preparation. html; accessed 6 August 2015). 
We used the gll_iem_v06.fits Galactic interstellar emission model, and the 
iso_P8R2_SOURCE_V6_v06.txt isotropic spectral template for the SOURCE 
event class (front+-back). The Cygnus Loop, Cygnus Cocoon, 1 Cyg and HB 
21 have been included in the fitting, as well as bright 3FGL sources. No source 
is detected at the position of V404 Cygni. We derive upper limits of 8 x 1077 
photonscm~s~! in the 100 MeV to 1 GeV band, and 3 x 10°? photonscm 7s"! 
in the 1-10 GeV band. 
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arb. units 


100 1000 10000 
Energy [keV] 


Extended Data Figure 1 | Spectral shape of annihilation emission from 
a relativistic thermal pair plasma. Each curve shows intensity per unit 
energy and is labelled with 0 (= kT/m,c’), the dimensionless temperature. 
The model is used to quantify the width of the observed annihilation 

line (see Methods section ‘Spectral fitting’). The grey shaded area is the 
integrated flux over all energies for this thermal plasma annihilation 
model (TPA). 
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Extended Data Table 1 | Spectral fit parameters for the flaring epochs of V404 Cygni 


: fer ey Ec Er ren B Ey roe kT rpa 
Orbit arsteel [keV] [keV] arse [keV] a | [keV] 
1554 —-2.31(11) -1.44(5)  40(2),—s«91(5) ~—S«2.18(59) -3.89(9) 220(113) 1.9(1.2)  29(44) 
1555 —-2.91(16) -2.18(5)  42(1) ~—86(3) 1.48(3) -3.38(4) 95(12) 6.5(1.6)  173(46) 
1557 7.30(5) -2.08(1)  40(1) = 99(1) 4.088) -3.30(2) 122(7) 1.2(9) 2(1) 


These epochs are shown in Fig. 1. Uncertainties are given in brackets, in units of the last digit. The redshift seen in orbit 1557 is 0.101(1). The amplitudes Ap and Bo are normalized to the flux at 
100 keV. For nomenclature, see Methods section ‘Spectral fitting’. 
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Extended Data Table 2 | Goodness-of-fit for spectra 


Orbit 1554 1555 1557 
x°(M1) / dof (total spec.) 63.7 / 38 69.2/38  280.9/37 
x°(M1) / dof (E < 150 keV) 49.2 / 24 56.2/24  265.7/24 
x°(M1) / dof (E > 150 keV) 14.5/14 13.0/14  15.2/13 
x°(M2) / dof (E > 150 keV) 12.5/16 25.6/16  30.1/16 


The goodness-of-fit for spectra corresponding to epochs shown in Fig. 1 is measured by \2/d.o.f. 
(d.o.f., degrees of freedom). We illustrate systematics by giving results for different energy regions, 
and for different models (see Methods section ‘Spectral fitting’). Model 1 (M1) is the full thermal 
pair annihilation model, and model 2 (M2) is a high-energy cut-off power-law model for Compton- 
ized disk emission plus another independent power law capturing accelerated particle emission. 
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Extended Data Table 3 | Spectral fit result details for the alternative model M2 


Orbit (Model) 10 3bh a Ec Era 10 Sh a 
cm? s keV : Rad Load cm? s keV : 
1554 (M2) 2.26(15) -1,36(3) 40(2) 79(4) 0.14(6) -1,94(33) 
1555 (M2) 2.58(24) -2.16(6) 42(1) 70(3) 0.37(3) -2.27(76) 
1557 (M2) 7.38(25) -1,50(19) 38(1) 50(10) 1.86(43) -2,62(20) 


See also Extended Data Table 2. Model 2 (M2) is as in Extended Data Table 2. The amplitudes Ao; and Aoz are normalized to the flux at 100 keV. For nomenclature, see Methods section ‘Spectral fitting’. 
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Extended Data Table 4 | Significance estimates of additional components 


Orbit 1554 1555 1557 Total 


Ay? / Significance [o] 5.7/2.4 80.3/9.0 250.8/15.8  336.8/18.3 


Axi / Significance [o] 3.5/1.9 12.4/3.5 8.2/2.9 24.1/4.9 


Ax? describes the improvement of an additional high-energy extension over a high-energy cut-off power-law 
model only, and AxS of an additional thermal pair annihilation feature over the model combining a high-energy 
cut-off power-law model and its high-energy power-law extension. Significances have been calculated by 
?-tests with one additional component each. For nomenclature, see Methods section ‘Spectral fitting’. 
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Extended Data Table 5 | Spectral fit parameters for the third flaring epoch 


Bo Ippa Fs 11 
Ex kT rpa E centroid k Trea 


Orbit 
1557a 4.27(42) -3.4(1) <200 1.2(1.0) 511(47) 1.5(0.5) 458(25) 4(3) 
1557b 5.73(1.20) -3.8(3) <200 3.3(2.9) 489(26) 5.3(1.9) 466(-3/+45) - 


The energy region between 200 keV and 2,000 keV for orbit number 1557 is shown. In addition to a hot thermal annihilation plasma with temperatures around 500 keV, components are either another, 
Doppler-shifted, thermal annihilation line (1557a), or an ortho-positronium continuum (1557b). The amplitude Bo is normalized to the flux at 100 keV. The y? goodness-of-fit values are 9.3 and 5.9 
for 9 and 8 d.of., respectively. For nomenclature, see Methods section ‘Spectral fitting’. 
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Late Tharsis formation and implications for 


early Mars 


Sylvain Bouley!?, David Baratoux*, Isamu Matsuyama’, Francois Forget®, Antoine Séjourné!, Martin Turbet® & Francois Costard! 


The Tharsis region is the largest volcanic complex on Mars and 
in the Solar System. Young lava flows cover its surface (from the 
Amazonian period, less than 3 billion years ago) but its growth 
started during the Noachian era (more than 3.7 billion years ago). 
Its position has induced a reorientation of the planet with respect 
to its spin axis (true polar wander, TPW), which is responsible 
for the present equatorial position of the volcanic province. It has 
been suggested that the Tharsis load on the lithosphere influenced 
the orientation of the Noachian/Early Hesperian (more than 
3.5 billion years ago) valley networks! and therefore that most of the 
topography of Tharsis was completed before fluvial incision. Here we 
calculate the rotational figure of Mars (that is, its equilibrium shape) 
and its surface topography before Tharsis formed, when the spin axis 
of the planet was controlled by the difference in elevation between 
the northern and southern hemispheres (hemispheric dichotomy). 
We show that the observed directions of valley networks are also 
consistent with topographic gradients in this configuration and thus 
do not require the presence of the Tharsis load. Furthermore, the 
distribution of the valleys along a small circle tilted with respect 
to the equator is found to correspond to a southern-hemisphere 
latitudinal band in the pre-TPW geographical frame. Preferential 
accumulation of ice or water in a south tropical band is predicted 
by climate model simulations of early Mars applied to the pre-TPW 
topography. A late growth of Tharsis, contemporaneous with valley 
incision, has several implications for the early geological history 
of Mars, including the existence of glacial environments near the 
locations of the pre-TPW poles of rotation, and a possible link 
between volcanic outgassing from Tharsis and the stability of liquid 
water at the surface of Mars. 

The Tharsis bulge is the largest volcano-tectonic centre on Mars. Its 
growth started during the Noachian epoch (>3.7 billion years ago)? 
and the associated enormous transfer of mass, energy and release of 
volatiles from the mantle had implications for the planet's evolution, 
including its climate, surface environment and mantle dynamics. 
The earliest signs of activity are limited to Noachian extensional tec- 
tonics observed around Claritas Fossae in the ancient Syria Planum, 
Thaumasia and Tempe Terra regions”. Assuming a 100-km-thick elastic 
spherical shell, models of the topographic effect of the Tharsis load have 
suggested that most of the bulge was largely in place by the end of the 
Noachian epoch. If so, it could have influenced the orientation of valley 
networks!. However, the elastic lithosphere thickness at the Noachian 
according to modern estimates® (<30km) corresponds to weak support 
of the Tharsis load by lithospheric flexure. In fact, early completion of 
the Tharsis bulge is questionable in light of recent studies: the formation 
of radial and concentric structures around Tharsis (including Valles 
Marineris) indicates complex and multi-stage growth of the bulge from 
the late Noachian to the Amazonian7; the inferred crust thickness at 
the time of formation of the Coracis Fossae rift is evidence of late 


(<3.5 billion years ago) crustal growth in the Tharsis region*; and 
tectonic structures initially assigned to the Noachian era” eventually 
also affect Hesperian units (Extended Data Fig. 1). Thaumasia and 
Claritas Fossae are the only Noachian units of Tharsis above the average 
elevation of the southern hemisphere of Mars (Extended Data Fig. 1). 
However, these regions at the southern edge of Tharsis have remnant 
crustal magnetization, in contrast with the rest of the magmatic prov- 
ince, and their formation has been related to a pre-Tharsis orogenic 
event®. 

The rotation axis orientation of a planet is constrained by its 
degree-2 gravity field, which is directly related to non-spherical con- 
tributions to the inertia tensor. Before Tharsis, the hemispheric dichot- 
omy of the topography controlled the orientation of the planet relative 
to its spin axis®. The growth of Tharsis must have induced a reorien- 
tation of the planet with respect to its spin axis (TPW) moving 
the mass anomaly to its current position close to the equator®". 
The rotation poles before Tharsis’ formation and TPW (palaeo poles) 
can be constrained by removing the Tharsis and remnant rotational 
figure contributions to the degree-2 gravity field. This yields a 
palaeo north pole at (100.5° + 49.5° W, igang: N) and a palaeo 
south pole at (79.5° + 49.5°E, fii S)!°, near the centre of the 
crustal dichotomy (80° E, 60° S)’. 

In addition to a rotation of the geographic reference, TPW induces 
an adjustment of the topography as a result of a new equilibrium rota- 
tional figure’®. A large TPW event (20°-25°) such as the one triggered 
by the formation of Tharsis may have had global implications. For 
instance, the distribution of surface landforms influenced by local 
slopes or early Mars climate simulations should be calculated within 
a reference frame corresponding to the planet’s configuration at the 
time of the formation of such landforms. Valley networks, incised dur- 
ing an early climate distinct from the present climate, are mainly 
located on the highlands!” within a domain of latitudes ranging from 
—60° S to +30° N (Fig. 1a). In the present reference frame, the density 
and spread in latitude of Early Hesperian/Noachian valley networks 
show large variations with longitude!”. The valley network distribution 
is dominated by local or regional slopes rather than by elevations'”"?. 
However, no combination of topographic parameters is able to explain 
the lack of valley networks between 30° S and 60° S from the Argyre 
basin to the east of the Hellas basin and the very low density in Arabia 
Terra (Fig. 1). In fact, the valley networks occur in a band that appears 
to follow a small circle tilted with respect to the present equator. We 
have quantitatively confirmed this observation by least-squares adjust- 
ment of the valley network density map to a small circle (Methods). 
The normal vector of the plane containing the small circle defines a 
north pole located north of Tharsis along the meridian passing 
through the centre of the Tharsis bulge (118° W, 69° N) and a south 
pole south of the Hellas basin (62° E, 69° S). This rotation axis is con- 
sistent with the calculated palaeo north pole location before the 
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Toulouse, France. “Institut de Recherche pour le Développement et Institut Fondamental d'Afrique Noire, Dakar, Sénégal. “Lunar and Planetary Laboratory, University of Arizona, Tucson, Arizona 
85721, USA. ®Laboratoire de Meétéorologie Dynamique, Institut Pierre Simon Laplace, CNRS, Université Pierre et Marie Curie, Paris, France. 
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Figure 1 | Noachian/Early Hesperian valley networks distribution and 
density!” before and after TPW. a, In the present reference frame. The 
palaeo pole positions are indicated with a diamond, with the spread in 
longitude and latitude for solutions shaded from black (14°) to red (15°), 
corresponding to the associated root-mean-square value for each solution, 


formation of Tharsis and TPW (100.5° +49.5° W, 71.1°117'5: N; ref. 10). 
The proximity of the palaeo north pole inferred from the valley net- 
work distribution and the one inferred from degree-2 spherical har- 
monic gravity without Tharsis’° is remarkable because these results 
are independent. 

In the pre-TPW configuration, the valley network distribution 
defines a latitudinal band of +14° centred at 24° S, that is, in the 
south tropical regions (Fig. 1b). The incision of valley networks 
spans the Hesperian—Noachian (>3.5 billion years ago) period'*5 
and it is unclear whether the valley network formation or the TPW 
event caused by the Tharsis rise occurred first. To investigate whether 
the Tharsis bulge controlled the direction of the valley network, we 
modelled the stream network in a latitudinal band (from 40° S to the 
dichotomy) in the pre-TPW reference frame for a topography without 
Tharsis and before TPW at 1° per pixel (available in the Supplementary 
Information; see also Methods) and for a present-day topography with 
Tharsis at the same resolution. The directions of large-scale valley 
networks in the pre-TPW and in the present configuration are both 
compatible with the observed directions of valley network (Extended 
Data Figs 1 and 2). In both configurations valley networks are oriented 
towards the north, reflecting the topographic dichotomy of the Martian 
surface. This result indicates that the presence of the Tharsis bulge is 
not necessary to explain their orientation. However, the occurrence of 
valley networks within a palaeo tropical band, between the equator and 
40° S, subject to precipitation (ice, snow or rainfall), supports a post- 
incision Tharsis-driven TPW during the Early/Late Hesperian period. 


o° 45°E 
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135° E 180° 
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as given by equation (16) (see Methods). The white dashed line is the 
pre-TPW equator. b, In the pre-TPW configuration. Valley networks occur 
within a latitudinal band of +14° centred at 24° S. The black dashed line is 
the present equator. 


Early Mars climate simulations!*!* assuming the present topogra- 
'Yy g Pp Pog, 


phy predict patchy ice/water accumulation in a south tropical band for 
a cold/icy scenario'”!® and precipitation around Hellas basin and in 
Tharsis region for warm/wet conditions'®. Both kinds of simulation fail 
to explain the occurrence of valley networks down to 45° S at the east 
of Hellas and down to 60° S at the south of Tharsis. They also predict 
substantial rain/snowfall in the west of Tharsis. This prediction does 
not match the lack of valley networks in this region. We performed 
new simulations using the same model and conditions (see Methods) 
using a pre-TPW topography. We found that ice tends to stabilize and 
accumulate in a tropical band (Fig. 2) similar to the distribution of 
valley networks (Fig. 1b), as a result of enhanced precipitation induced 
by adiabatic cooling when the atmospheric circulation transports water 
vapour southward, up to the highlands. An icy patch is also predicted in 
the Tharsis region but recent volcanic flows preclude the preservation 
of ancient morphologies in this region. Intensity of drainage is affected 
by several factors and is not only related to the intensity of precipita- 
tion. The geological history subsequent to valley network formation 
may also be responsible for heterogeneous modifications of the palaeo 
drainage intensities. 

Can we find any geological clues of a past polar climate at the location 
of the palaeo poles? The palaeo north pole is located in Scandia Colles 
(Extended Data Fig. 4), a knobby terrain that probably represents one of 
the rare Noachian units in the north polar region”. Interestingly, some 
of the landforms of this region have been attributed to Late Hesperian 
polar ice retreat or melting*®?!, which could be the result of the 
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Figure 2 | Permanent ice deposits predicted by the global climate 
model for early Mars, with obliquity 45°, a circular orbit and mean 
surface pressure ~0.2 bar. a, With present-day topography. b, With 
topography before TPW, without the Tharsis bulge. These maps show 
the yearly minimum of surface ice in simulations with just enough ice 

in the system to simulate the location where the ice stabilizes under a 
specific configuration. Elevations are given in kilometres, with solid lines 
representing isoaltitudes equal to or higher than 0 km and dashed lines 
representing isoaltitudes lower than 0 km. 
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displacement of the pole at this time. This palaeo north pole is located 
within the unique region of the northern plains where the SHARAD 
(Shallow Subsurface Radar) instrument has detected the base of ground 
ice at a depth of several tens of metres” that could correspond to the 
remnant of an ancient ice cap. Similarly, the southern hemisphere radar 
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Figure 3 | Scenario for a TPW driven by a late growth of Tharsis 
contemporaneous with valley network incision. a, During the Early 
Noachian period, the planet was initially oriented such that the dichotomy 
was parallel to the equator®. b, Precipitation occurred in a tropical 
band during the Noachian and the Early Hesperian periods and was 
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interface detected in the Dorsa Argentea region has been attributed to 
the remnant of an Hesperian-age buried ice cap”*. The palaeo north 
pole is also surrounded by an area of anomalous neutron counting 
rates” associated with high hydrogen content (indicating water) in 
the first metre below the surface (Extended Data Fig. 5). Although the 
formation of this reservoir is likely to be recent, it may be related to 
the slow sublimation of deeper and older ice reservoirs favouring the 
stability of ice near the surface. 

The palaeo south pole is located in the Noachian volcanic unit, 
Malea Planum. Some deposits in this region (Pityusa Patera) are pos- 
sible remnants of an extensive polar ice sheet emplaced in the late 
Noachian to mid-Hesperian”’. Erosional lineated terrains” and faintly 
dentritic Hesperian valley networks (Axius Valles and Mad Vallis)”° 
are other features expected in a context of erosion by meltwater 
associated with retreating ice sheets”*?”. A higher concentration of 
double-layered ejecta in this unit”® also suggests the presence of an ice- 
rich substratum**”? that might be related to the polar palaeo location 
of this unit. 

In summary, we consider that the planet was initially oriented such 
that the dichotomy was parallel to the equator (Fig. 3a) during the 
Noachian and the Early Hesperian periods®. Precipitation occurred in 
a tropical band during these periods. The incision of valley networks 
was contemporaneous with an initial phase of magmato-tectonic activ- 
ity at Tharsis (Fig. 3b). The prolonged magmatic activity during the 
Hesperian moved Tharsis close to the present equator after the end of 
most of the fluvial activity (Fig. 3c). This TPW event may have gener- 
ated stresses that were large enough to produce a global tectonic pattern 
(Extended Data Fig. 6). Global stress and tectonic pattern calculations 
show that extensional features oriented roughly north-south should 
be observed around the palaeo poles”. These fault patterns are not 
observed’. However, the identification of the expected global tectonic 
pattern may be complicated by additional stresses generated by Tharsis 
loading or the possibility of stress relaxation”®. Stress relaxation may 
arise as a result of the expected slow TPW speed of about 1° per million 
years?) 

Causal links between volcanic outgassing, build-up of atmospheric 
pressure and precipitation have already been considered, but the valley 
networks were thought to have formed after the emplacement of most 
of the Tharsis load**. In light of our results, we conclude that precipita- 
tion occurred during the birth and growth of the Tharsis bulge rather 
than when its activity was declining. Such a scenario is more plausi- 
ble than the post-Tharsis formation of valley networks, considering 
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Tharsis formation causes a TPW, 
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contemporaneous with the growth of Tharsis. c, The prolonged magmatic 
activity at Tharsis during the Hesperian period drove a slow TPW and 
moved Tharsis close to the present equator after the end of most of the 
fluvial activity. 
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early degassing and progressive depletion of volatiles in the mantle 
source. The calculated pre-Tharsis topographic map of Mars provides 
a framework within which to examine the first billion years of its 
geological history. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Topography model. The topography without Tharsis and rotational deformation 
due to TPW was calculated using gravity and topography data’. We adopt the 
expected rotation pole location before the emplacement of Tharsis (259.5° E, 71.1° 
N), the expected elastic lithosphere thickness at the time of loading (58km), and 
the expected Tharsis gravity and shape coefficients (tables 3-5 in ref. 10). 

The gravitational potential external to Mars, at a point with spherical coordi- 
nates (r, 6, b), where @ is co-latitude and ¢ is longitude, can be expanded in spher- 
ical harmonics with unnormalized expansion coefficients Cj, and Sj, as follows 
(see, for example, refs 34 and 35): 


co ol 1 
U(r, 0,0) “ a> (| 
1=2 m=0 
X Pim(cos0) [Cimcos(md) + Sincos(m¢)] (1) 


where G is the gravitational constant, M and R are the mass and mean radius of 
Mars, and P,,, are unnormalized associated Legendre functions given by: 


Pin it) = (1 = 12"? Py) (2) 


un 
where P; are Legendre polynomials**°. We do not include the Condon-Shortey 
phase factor of (—1)”. The unnormalized coefficients are related to the normalized 
expansion coefficients, Ci, and Sim: 


[sc] 
Sim 
where 6jn9 is the Kronecker delta function. 


We expand the geoid, N, and shape, S, in spherical harmonics with unnormal- 
ized expansion coefficients as: 


(l+m)! 
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where the equipotential is chosen to be the mean potential at the equator, and: 


co 1 
S= > YO Pim(Cosé) [cimcos(m@) + simsin(md)] (5) 


l=2 m=0 


where Cj», and sj, are the shape expansion coefficients. We use the Jet Propulsion 
Laboratory Mars gravity field MRO95A*’ and the Mars Orbiting Laser Altimetry 
(MOLA) shape model**. The topography is given by T=S—N. 

We remove rotational deformation contributions associated with the change 
in the rotational potential due to TPW. Given the location of the rotation pole, 
(Og, Or), the rotational potential expansion coefficients are: 


Com(On bp) 
SCOR. op) 


2p3 
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= 2 Sno) 2 sin(m@z) 


(6) 


where 2 is the rotation rate. The gravity expansion coefficients associated with 
rotational deformation due to TPW can be written as: 


Cr |, |Cim(One Pre) 7 Cn (OR Opi) : 
tTew| | oR Oe - ~ 2! oR De: (7) 
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where ky is the degree-2 tidal Love number describing the long-term gravity defor- 
mation due to the change in the rotational potential, and (Oz, drys) and (Axi, dri) 
are the final (present) and initial (before TPW) spherical coordinates of the rota- 
tion pole respectively. Similarly, the shape expansion coefficients associated with 
rotational deformation due to TPW can be written as: 


cr], |CimlOne Pre) ; Cin Oni Pri) 
Tew| 7} oR 4 ENA) op 6 (8) 
Sam Soiml Rf Pre) Soiml Ris Ppi) 


where hy is the degree-2 tidal displacement Love number describing the long-term 
shape deformation due to the change in the rotational potential. 

The dimensionless ky and hy Love numbers depend on Mars interior struc- 
ture and rheology. We assume the five-layer internal structure model described 
in table 2 of ref. 10, and use the classical propagator matrix method (for example, 
ref. 39) to calculate their values. For the expected elastic lithosphere thickness of 
58km, k2=1.10 and hy = 2.00. These Love numbers are not sensitive to elastic 
lithosphere thickness. For example, kz = 1.19 and hy) =2.19 for a model without 
an elastic lithosphere. 

We compute the expansion coefficients for the topography without Tharsis and 

the rotational deformation associated with TPW by removing these contributions 
from the observed gravity and shape coefficients. The Tharsis gravity and shape 
coefficients are taken from tables 3-5 of ref. 10, and the gravity and shape coeffi- 
cients for the rotational deformation associated with TPW are given by equations 
(6) and (7). Finally, we compute the spherical harmonic coefficients in the pre- 
TPW frame using Wigner-D functions. 
Determination of the palaeo poles. The palaeo pole positions are calculated from 
a least-squares adjustment of the valley network density to a small circle on the 
sphere. (A small circle is given by the intersection of the sphere with a plane.) 
Considering the plane equation in Cartesian coordinates: 


z=ax+ byte (9) 
and the conversion equation of spherical coordinates (r, A, y) into Cartesian coor- 


dinates, where r is the average radius of Mars, \ is the longitude, and y the latitude: 


x=rcosAcosp 
y=rsinAcosp (10) 
z=rsiny 


the equation of a small circle is given by: 


tang, = acosAsc + bsinAse + 
rcosy,. 


(11) 


This equation may be solved with u=cos((sc) and by extracting the roots of the 
second-degree polynomial in u. We thus have: 


ALIA 


a ee 
2[A? + 1] 


Leo = arccos (12) 


where: 
A = acosrgc + bsinAse 


(13) 
and: 

(14) 
The parameters a, b and c are determined by minimizing of the sum of the weighted 
residuals between calculated latitudes and observed latitudes of the valley network 


density map: 


v=D laa) — oP? 


(15) 


where Aj; and y; are coordinates of the valley network density map and dj is the 
corresponding density of valley networks'*. Adjustment is achieved by direct 
exploration of the three-dimensional parameter space. This approach is useful 
to determine the best solution corresponding to the minimum of v, but also to 
determine the spread of solutions (a, b, c) corresponding to any chosen residual 
greater than the minimum value. The residual may be expressed as a root mean 
square (r.m.s.) in latitude using: 


Vmin 
r.m.s. = > 
4 > di 

i 


The palaeo pole positions are then given from parameters a and b and the following 
equations: 


(16) 


1 


Poole = 90 — arccos 


pole = arctan| ?) 
a 
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The best solution is found at 24° S and corresponds to a r.m.s. in latitude of 14° 
that is equal to the spread of valley networks in a direction perpendicular to 
the small circle. The best palaeo pole positions are indicated with a diamond in 
Fig. 1a, together with the spread in longitude and latitude. Colours from black to 
red correspond to the associated r.m.s. value for each solution, as given by equation 
(16), and vary from 14° (black) to 15° (red). 

Determination of the stream network before and after Tharsis emplacement. 
The relief controls the direction of runoff processes. To investigate whether the 
emplacement of the Tharsis bulge controlled the direction of the valley network, 
the stream network was modelled for a topography of Mars without Tharsis and 
for a topography with Tharsis with the same resolution of 1° per pixel. We used 
the Arc Hydro tool in ArcGIS (http://downloads.esri.com/archydro/archydro/) 
that includes different functions to extract hydrological parameters (that is, the 
flow direction, the flow accumulation and the stream definition*®). The digital 
elevation model (DEM) can contain artefacts of DEM construction, but these have 
been corrected. 

The flow direction is based exclusively on topography. For each cell, the flow 
direction corresponds to the direction of the steepest slope between the cell and 
the eight neighbouring cells. The result is a raster with the value of every cell cor- 
responding to one of the eight possible flow directions. Then, based on the flow 
direction raster, for each cell, the flow accumulation is calculated as the total num- 
ber of cells drained upstream of each cell. The flow accumulation approximately 
represents the drainage network. 

Finally, the stream network is defined, based on the flow accumulation network 
and on a user-defined river threshold value (Extended Data Fig. 2). For each cell, if 
the value is greater than this threshold, it is defined as a stream. A smaller threshold 
will result in a denser stream network and usually in a greater number of delineated 
catchments, which may hinder delineation performance. For both DEMs, in order 
to achieve general flow directions, which are not meant to correspond directly to 
the actual valley network, we chose a river threshold value of 15 cells. 

The orientation of the stream network was calculated by using an ArcGIS tool"). 
This orientation was calculated with respect to the north (azimuth). The data were 
compiled in a rose diagram representing the orientation of the stream network for 
the DEM without Tharsis (total number of measurements N = 702; Extended Data 
Fig. 3a) and with Tharsis (N = 698; Extended Data Fig. 3b). 

The distributions of the stream network for both topographies (before and after 
Tharsis emplacement and TPW) are similar (Extended Data Fig. 2). The stream 
network is mainly oriented towards the north in both configurations (Extended 
Data Fig. 3). The calculation shows that the flow direction in the highlands was 
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primarily controlled by the dichotomy and limits the possible influence of the 
Tharsis bulge. 

Global climate model simulations. We use the Laboratoire de Météorologie 
Dynamique Early Mars 3D global climate model'®!”. It includes the modelling 
of the CO) cycle (condensation and sublimation on the surface and in the atmos- 
phere), a water cycle (transport of water vapour and clouds, precipitation and 
evaporation) and a detailed radiative transfer code adapted to a thick CO2 atmos- 
phere and both CO, and H30 clouds. Here we used the ice equilibration algorithm 
from ref. 17 designed to calculate the location where the ice deposits stabilize 
at equilibrium under early Mars conditions, after the equivalent of thousands of 
years of evolution. This algorithm was shown to be insensitive of the assumed 
initial state and in particular to the location of the ice reservoir at the beginning 
of the simulation. 

Figure 2 presents a map of the permanent ice deposits predicted for the 
most likely conditions for early Mars, that is, with an obliquity of 45° (ref. 42) and 
a mean pressure of 0.2 bar of CO; (ref. 16). Similar results were obtained with a 
mean pressure of 1 bar. 
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Extended Data Figure 1 | Map of Tharsis region with 0 m, 3,000 m and 6,000 m isoaltitude lines. Noachian terrains are mapped in light red for terrains 
lower than 3,000 m and in dark red for terrains higher than 3,000 m. Hesperian and Amazonian terrains are in grey. Age units are taken from the most 
recent geological map of this region*’. The black cross on Tharsis Montes is the location of the centre of mass of the Tharsis dome. 
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Extended Data Figure 2 | Modelled stream network before and after Tharsis emplacement. a, b, Digital Elevation Model (DEM) with 1° per pixel 
resolution without Tharsis (a) and with Tharsis (b). The stream network was modelled using the Arc Hydro tool in ArcGIS. 
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Extended Data Figure 4 | Geological map of the north polar region. The red cross indicates the location of the palaeo north pole (PNP), inferred from 
the valley network distribution. Figure modified from ref. 43; US Geological Survey. 
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Extended Data Figure 5 | Orthographic projection of lower-limit concentrations of water abundance at latitudes poleward of 50° N. The red cross 
indicates the location of the palaeo north pole, inferred from the valley network distribution. Figure modified with permission from figure 5 of Feldman, 
W.C. et al.”4, J. Geophys. Res., John Wiley and Sons, copyright 2004 by the American Geophysical Union. 
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Extended Data Figure 6 | Predicted global-scale stress and tectonic In the tectonic pattern (b), the orange, blue, and light grey lines indicate 
patterns due to the Tharsis-driven TPW event. Solid circles indicate the strike of the expected normal, thrust and strike-slip faults, respectively. 
the locations of the palaeo poles. In the stress pattern (a), crosses indicate Contours correspond to the deviator stress in units of MPa. Solid black 
directions and relative magnitudes of principal stresses, and orange and lines mark the boundaries between different tectonic regions. 


blue lines correspond to extensional and compressive stresses, respectively. 
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Enhancing coherence in molecular spin qubits via 


atomic clock transitions 


Muhandis Shiddiq!*, Dorsa Komijani!*, Yan Duan?, Alejandro Gaita-Arifio2, Eugenio Coronado? & Stephen Hill! 


Quantum computing is an emerging area within the information 
sciences revolving around the concept of quantum bits (qubits). 
A major obstacle is the extreme fragility of these qubits due 
to interactions with their environment that destroy their 
quantumness. This phenomenon, known as decoherence, is of 
fundamental interest!”. There are many competing candidates 
for qubits, including superconducting circuits’, quantum optical 
cavities*, ultracold atoms® and spin qubits®*, and each has 
its strengths and weaknesses. When dealing with spin qubits, 
the strongest source of decoherence is the magnetic dipolar 
interaction’. To minimize it, spins are typically diluted in a 
diamagnetic matrix. For example, this dilution can be taken to 
the extreme of a single phosphorus atom in silicon®, whereas 
in molecular matrices a typical ratio is one magnetic molecule 
per 10,000 matrix molecules!°. However, there is a fundamental 
contradiction between reducing decoherence by dilution and 
allowing quantum operations via the interaction between spin 
qubits. To resolve this contradiction, the design and engineering 
of quantum hardware can benefit from a ‘bottom-up’ approach 
whereby the electronic structure of magnetic molecules is 
chemically tailored to give the desired physical behaviour. Here 
we present a way of enhancing coherence in solid-state molecular 
spin qubits without resorting to extreme dilution. It is based on 
the design of molecular structures with crystal field ground states 
possessing large tunnelling gaps that give rise to optimal operating 
points, or atomic clock transitions, at which the quantum spin 
dynamics become protected against dipolar decoherence. This 
approach is illustrated with a holmium molecular nanomagnet in 
which long coherence times (up to 8.4 microseconds at 5 kelvin) are 
obtained at unusually high concentrations. This finding opens new 
avenues for quantum computing based on molecular spin qubits. 

One of the proposed approaches to obtaining spin qubits is that of 
using magnetic molecules*"!°. Up to now, coherence has been opti- 
mized through dilution and deuteration to minimize dipolar and 
hyperfine interactions, respectively? "'+'>!% A class of molecules in 
which these two sources of decoherence can be minimized by alter- 
native means are the so-called polyoxometalates. In the past, these 
metal oxide clusters have been used as model systems in molecular 
magnetism because of their ability to host magnetic ions in chemically 
tailored environments of high symmetry and rigidity'’. Currently, 
these molecules are seen as potential building blocks in quantum 
computing architectures'***, 

In the present study we chose the [Ho(W5Qj)2]? complex (abbre- 
viated HoWjo), which has been subjected to extensive structural, 
magnetic and spectroscopic characterizations that raised the possibil- 
ity of observing coherent spin dynamics***4. HoW 1 is formed by 
two molecular tungsten oxide moieties encapsulating a Ho*+ ion 
(Fig. 1). The geometry around Ho** exhibits a slightly distorted 
square-antiprismatic environment, which can be approximated by a 


Dag ‘pseudo-axial’ symmetry. This results in a splitting of the total 
angular momentum, J =8, ground state spin-orbit manifold according 
to its my quantum numbers. Quantitatively this splitting can be 
described by the spin Hamiltonian in equation (1), where the double 
summation parameterizes the crystal-field (CF) interaction which, for 
D4q symmetry, contains the axial terms B9O8, B9O9 and B°O? (see 
Methods for definition and discussion of terms in equation (1)**) 


k 
H= \> BI01+f-A-T+ppBo-g,-f—HygyBo-f (1) 
k=2,4,6 q=0 


This results in an isolated mj= +4 ground doublet, separated from the 
first excited states (mj= +5) by ~20cm |. This picture provides a 
reasonable description of the magnetic properties of this molecule”’. 
However, minor deviations from D4q symmetry that are present in the 
crystal make operative the tetragonal B40} CF interaction. 
Interestingly, the match between the (-tinteger) values of the ground 
state spin projections, m;= +4, and the tetragonal (that is, q=4) order 


of the main symmetry axis of the molecule results in the B404 
l 


(here Of = (4 +J*)) interaction generating an unusually large 
quantum tunnelling gap, A + 9.18 GHz (~0.3cm~!)™4. This gap is a 
crucial factor for the coherence of electron spin dynamics in molecular 
spin qubits, and is the main subject of the present study. 

The standard approach to probing coherent spin dynamics involves 
the use of electron spin echoes in pulsed electron paramagnetic reso- 
nance (EPR). The HoW jy system is attractive in this regard because 
its predicted tunnelling gap (~9.18 GHz, Fig. 1) is close to the X-band 
frequency associated with the most sophisticated EPR spectrometers. 
While the magnitude of the gap is set by Bj, interesting details of the 
EPR spectra are also determined by the hyperfine interaction between 
the Ho electron and nuclear spins (the second term in equation (1)). 
Holmium occurs naturally in only one stable isotope (1*Ho) with a 
nuclear spin of [= 7/2. A strong hyperfine coupling (Aj = 830+ 
10 MHz) results in the observation of eight (21+ 1) well-resolved 
transitions via continuous-wave (CW) high-frequency EPR measure- 
ments”. The energy level scheme that arises from the combination 
of CF and hyperfine coupling, together with the Zeeman interaction 
(third and fourth terms in equation (1)), gives rise to a series of 
avoided level crossings between m;= +4 states (with the same m)), 
resulting in multiple gaps in the energy diagram near zero field 
(Fig. la). 

Single crystals of Nag[Ho,Y (1-x)(WsO1s)2]-1H2O (where Y (yttrium) 
is non-magnetic) were prepared with Ho concentrations ranging from 
x =0.25 to x=0.001, that is, up to three orders of magnitude away 
from the usual high-dilution limit’®, allowing a study of the effects of 
dilution on electron dipolar spin-spin decoherence. Figure 2a displays 
electron-spin-echo- (ESE-) detected EPR spectra recorded at 5 K for 
a dilute (x= 0.001) sample at frequencies from 9.1 GHz to 9.8 GHz, 
with 6 = 29° (6 is the angle between the applied field, Bo, and the z axis 
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Figure 1 | HoWj tunnelling gap. a, Main figure, Zeeman diagrams for 


the mj= +4, I=7/2 ground state, with Bollz: the thin grey lines assume 
exact Dyq symmetry, while the thick black curves assume an ‘axial + B{O? 
parameterization”. Numbers on the top and right axes indicate m, values. 
Inset (blue trace at bottom), the corresponding 9.64 GHz CW EPR 
spectrum (reproduced from figure 8 of ref. 24 with permission from the 
Royal Society of Chemistry) is observed well below the D4q prediction 
(grey arrow denotes expected highest field resonance), providing evidence 
for the tunnelling gap. Indeed, the ‘axial + B{O?’ parameterization gives 
excellent agreement with the data, both in terms of resonance positions 
(blue arrows) and intensity (arrow thickness). The red vertical lines, 
meanwhile, indicate the locations of CTs. b, Main figure, 3D EPR intensity 
map including inhomogeneous broadening due to a Gaussian distribution 
in BY (opa4=2.1 x 10-° cm™!); darker shading represents stronger 
intensity, with contours at the to, (red) and +20, (blue) levels 

(74 = 123 MHz, the s.d. in A). Red arrows denote CTs and dashed lines 
denote locations of 9.64 GHz resonances. Inset, the HoWj9 molecule. 


of the crystal); ESE signals were generated using a two-pulse Hahn- 
echo sequence (see Methods)”. Four broad peaks of equal intensity 
are observed at the two lowest frequencies (9.11 GHz and 9.16 GHz), 
which were selected to be close to the gap minima in Fig. 1b. With 
increasing frequency, these peaks split and move symmetrically apart, 
as expected on the basis of predictions in Fig. 1b. For the most part, 
the data lie on the simulated curves, with the obvious exception of the 
two lowest frequencies and some lower-field (<60 mT) data points. 
The simulations are based on previously determined Hamiltonian 
parameters”, and the spectra are plotted against the re-scaled longi- 
tudinal applied field, Bo, (=Bocos6), to facilitate comparisons between 
different samples (see Methods). 

Two-pulse ESE measurements were separately used to determine 5 K 
transverse relaxation times, T, at selected points within the spectrum 
for the x =0.001 concentration. The longest values of T, are found in 
the vicinity of the gap minima (at Boz = Bmin) for the smallest crystals 
(see Figs 2b, 3 and Methods), with values ranging from 5.2 1s to 8.4 js, 
whereas the values are substantially shorter away from the minima. In 
fact, the T> values exhibit sharp divergences right at Buin (Fig. 3). The 
key to understanding this behaviour is the quadratic field dependence 
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Figure 2 | ESE-detected spectra for a dilute sample. a, Variable frequency 
measurements at 5.0 K for an x =0.001 crystal, with 0 = 29°; the 
frequencies are indicated in GHz above each trace. b, Frequency versus 
field plot of the resonances in a. The data are in good agreement with 
simulations (solid curves) based on the ‘axial + B{O?’ parameterization”. 
Selected T> values (in 1s) determined from the measurements in Fig. 3 are 
indicated in blue close to some of the data points. Vertical error bars in b 
denote pulse excitation bandwidths (+1/27,,2, where 7,/2 is the duration of 
the 1/2 pulse), while horizontal error bars represent standard deviations 


(+s.d.) deduced from Gaussian fits to the resonances in a. 


of the EPR transition frequencies (f) close to the gap minima (see 
Methods) 
"8 


2A (Boz = Brin)? (2) 


f=At 


such that the derivative df/dBoz « (Boz — Bmin) 0 as Boz > Bmin; 
here, 7, is the z component of the gyromagnetic tensor. Although 
not explicitly included in equation (1), nearly all sources of dipo- 
lar decoherence (due, for example, to dynamics associated with the 
nuclear bath and collective electron spin excitations, or magnons) can 
be approximated as a time-dependent magnetic noise, 6Bo(t), acting 
on the central spin qubit (the spin being measured) via the Zeeman 
interaction. In other words, processes that flip nearby spins cause 
variations in the local field, 6Bo, at the position of the central spin, 
thereby altering its frequency/phase. Many of these processes involve 
indirect pairwise spin flip-flops (spin diffusion) that are extremely 
hard to mitigate, and persist to very low temperatures. The extreme 
axial anisotropy of HoW po results in an insensitivity to the perpen- 
dicular applied field component, Bo, (see Methods). Meanwhile, 
sensitivity to 6Bo,(t) vanishes (to first order) as Bp, > Bin and 
df/dBo,— 0, resulting in a vanishing contribution to the dipolar deco- 
herence. This is the concept behind ‘atomic clock transitions. Named 
after the principle which gives atomic clocks their exceptional phase 
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Figure 3 | T> divergence at the CTs. Field-swept T, measurements 
recorded at 5.0 K for a small x = 0.001 crystal at @ = 22° and various 
frequencies indicated in the key in the rightmost panel. The first four 
panels illustrate the divergences in T, at the CTs, referenced to the 
left-hand ordinate; the data are plotted in an expanded view as a function 
of (Boz — Bmin), with best-fit Bmin values given in each panel. The rightmost 
panel, meanwhile, displays T2 values well away from the CTs (see Fig. 1b), 
referenced to the right-hand ordinate. Error bars denote the standard 
error in T). 


stability, these transitions are protected against environmental noise, 
resulting in dramatically enhanced quantum coherence**””. Indeed, 
one expects the dephasing time, T>, to scale as (Bp; — Bin)" (n > 0, see 
Extended Data Fig. 1)”*”°, thus explaining the observed divergences 
at the clock transitions (CTs). For comparison, T, measurements are 
displayed in the rightmost panel of Fig. 3 for several ‘normal’ EPR 
transitions, that is, mj=—4 to +4 transitions away from the gap min- 
ima, where the frequency dependence approaches the linear regime 
and df/dBo,— z= 139.9 GHzT“! (Fig. 1). Although T> is moderately 
peaked at the centres of these resonances, the sharp divergences seen 
at the CTs are clearly absent (see Methods for further discussion). 

ESE-detected measurements for an x =0.01 sample reveal diver- 
gences in T) at the CTs that are essentially identical to those seen in 
Fig. 3, with maximum values ranging from 41s to 81s (see Extended 
Data Figs 1 and 2). However, T> values associated with ‘normal’ EPR 
transitions well away from the CTs are much shorter (~100 ns, not 
shown). Because the collection of ESE spectra requires the detection 
of an echo, the observation of these ‘normal’ EPR transitions is chal- 
lenging for x > 0.01. These findings are consistent with the idea that 
dipolar ‘noise’ increases with increasing Ho concentration, resulting 
in shorter Ts for the ‘normal’ EPR transitions, yet there is an apparent 
insensitivity of T> to the Ho concentration right at the CTs. 

Figure 4 displays 5 K ESE-detected spectra for a concentrated x=0.1 
sample that are in stark contrast to those in Fig. 2: narrow resonances 
are observed at the CTs that do not shift at all with frequency, that is, 
the data do not follow the simulations even though CW measurements 
indicate no measurable variation in the spin Hamiltonian parameters 
with Ho concentration”. The total suppression of ‘normal’ EPR tran- 
sitions is attributed to a further reduction of T, on increasing the Ho 
concentration, to the extent that an echo can no longer be detected. 
Nevertheless, the T, values at the CTs remain long (~0.7 1s), resulting 
in the narrow ESE-detected resonances. Indeed, because the echo 
intensity is T,-weighted, the resonance lineshape is a direct manifes- 
tation of the field dependence of T> at Byin. Analysis of CW EPR spec- 
tra suggests that the main contribution to the linewidth is a Gaussian 
distribution in the Bi parameter (p44 = 0.63 MHz). This causes sig- 
nificant vertical broadening of the tunnelling gap, A, and EPR transi- 
tion frequencies, as illustrated in Fig. 1b, which includes contours at 
the oy and +2¢, levels (04 = 123 MHz is the standard deviation in 
A; see Methods). These simulations indicate measurable intensity at 
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Figure 4 | ESE-detected spectra for a concentrated sample. a, Variable 
frequency measurements at 5.0 K for an x =0.10 crystal, with @ = 20°; the 
frequencies are indicated in GHz above each trace. The ESE resonances 
are attributed to CTs. a.u., arbitrary units. b, Frequency versus field plot of 
the CTs in a. Optimum T> values (in 1s) are indicated next to the 9.11 GHz 
data. Meanwhile, the curves correspond to predictions based on the CW 
EPR parameterization’. c, Field-swept T, measurements recorded at 5.0K 
for a separate x =0.10 crystal at 6 = 25° and frequencies of 9.12 GHz (blue 
squares) and 9.20 GHz (red circles). Vertical error bars in b denote pulse 
excitation bandwidths (+1/27,/2, where 7/2 is the duration of the 1/2 
pulse), while horizontal error bars represent standard deviations (+s.d.) 
deduced from Gaussian fits to the resonances in a. Error bars in c denote 
the standard error in T>. 


the CTs up to at least 9.4GHz. However, the Bj distribution does not 
shift the CTs appreciably to lower or higher fields, that is, all molecules 
in the distribution have their CTs at essentially the same B,,i, values. 
This explains the observation of narrow CT peaks spanning a wide 
frequency range in the x=0.1 sample (Fig. 4); similar behaviour is also 
discernible at other concentrations (see Extended Data Fig. 3). 

After magnetic ‘noise other sources of decoherence remain. First, 
the CTs do not protect against direct flip-flop processes that involve 
the central spin qubit”*”°. These energy-conserving events involve 
coupling to other spins via the off-diagonal component of the dipolar 
interaction (f Pag a+ J mi 2). The inhomogeneous broadening will pro- 
vide some protection against this source of dephasing, because it 
requires the central spin to be resonant with other spins. Nevertheless, 
direct flip-flops probably explain the shorter Ts at the CTs in the 
x =0.1 sample. However, unlike the aforementioned indirect spin dif- 
fusion processes, direct flip-flops can be controlled at the stage of 
device design through the tuning/detuning of individual CT frequen- 
cies. Second, coupling of the Ho spin to lattice dynamics (phonons) 
via the CF is also likely to provide significant decoherence pathways, 
particularly as the temperature is raised®'®. Indeed, a significant 


© 2016 Macmillan Publishers Limited. All rights reserved 


temperature dependence of T> is found at the CTs (a decrease by a 
factor of more than 2 upon heating the sample to 7 K), suggesting that 
T, may become limited by the spin-lattice relaxation time, T; (~20 1s 
at 5K). This is something that will be the subject of future 
investigations. 

The critical result from this study is the demonstration that CTs 
can be employed as a means of enhancing the coherence of molecular 
spin qubits in concentrated samples. Therefore, instead of attempting 
to suppress magnetic noise, which can be impractical at the stage of 
device design, we have shown here that one can fortify the molecular 
spin qubit itself against this noise through the use of CTs. In terms 
of design criteria, the molecule of choice should possess a large tun- 
nelling gap within the ground magnetic doublet that matches the 
working frequency of the EPR cavity. The key to this strategy is the 
chemical design of molecular structures with appropriate CF states. 
In rare-earth complexes with integer spin, this goal translates into 
matching the my components of the ground doublet with the rotational 
order (q) of the main symmetry axis of the molecule (see Methods). 
Although this is not trivial to achieve, the case of HoW po is not an 
isolated example. For example, within rigid polyoxometalate chem- 
istry, the terbium derivative of the [LnP5W300110] 2 series (where 
Ln indicates lanthanide) with pentagonal structure (approximate Cs, 
symmetry) has been characterized as having an my= + 5 ground state 
with an even larger tunnelling gap of ~21 GHz that may be suitable 
for pulsed Q-band EPR®. Tunability across this range (10-100 GHz) 
is desirable and practical for quantum information applications, given 
that it matches the clock rate of state-of-the-art microprocessors. 
Moreover, operation at these CTs requires application of only very 
moderate magnetic fields (<0.2T in the present example). Of course, 
this strategy can and should be combined with other ideas that are 
already being applied with success, such as using rigid lattices with 
a low abundance of nuclear spins!®. Nevertheless, it is remarkable 
that working with CTs offers the unique advantage of allowing long 
coherence times with high concentrations of molecular spin qubits. 
In fact, for other molecular spin qubit candidates, T, values of the 
order of tens of microseconds were only observable in deuterated and 
highly diluted samples of Cr7Ni molecular wheels'® and Cu(mnt), 
(mnt=maleonitriledithiolate) complexes". 
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METHODS 


Experimental details. Pulsed EPR measurements were performed on a commer- 
cial Bruker E680 X-band spectrometer equipped with a cylindrical TE), dielectric 
resonator (model ER 4118 X-MD5) with a centre frequency fy = 9.75 GHz. Single- 
crystals of Nao[Ho,Y -x)(WsO1g)2]-1H2O (x=0.001 to 0.25) were prepared accord- 
ing to the method described in ref. 23. Samples were re-crystallized before study, 
then transferred to the spectrometer directly from the mother liquor and cooled 
rapidly in order to prevent loss of crystallinity due to evaporation of lattice solvent. 
The sample temperature was controlled using an Oxford Instruments CF935 helium 
flow cryostat and ITC503 temperature controller. A strong temperature dependence 
of T, at the CTs required operation of the cryostat at a temperature of 5.0 K in order 
to ensure good thermal stability and sample-to-sample reproducibility. 

For each series of measurements, a single crystal was mounted on a 4 mm 
diameter quartz rod and positioned at the centre of the cylindrical resonator for 
perpendicular mode excitation. The tendency for samples to rapidly lose solvent, 
and the low symmetry PI space group of the HoW 19 compound, made it impossi- 
ble to index and align crystals before mounting. However, the Bruker E680 and ER 
4118X-MD5 dielectric resonator combination allows for in situ sample rotation 
about a single axis. Each crystal was therefore aligned as best as possible on the 
basis of angle-dependent CW EPR measurements performed at 9.75 GHz and 
5.0K. The remaining misalignment, 0, between Bo and z was determined by scal- 
ing the applied field to match the simulations in Fig. 1 (see below). A 0 < 30° cri- 
terion was then applied; crystals not meeting this condition were discarded and a 
new sample selected for study. 

When overcoupled for ESE measurements, the bandwidth of the resonator is 

given by Af=fo/Q* 250 MHz, where the loaded quality factor Q~ 40. This is 
sufficient to allow variable-frequency measurements with reasonable microwave 
B, fields down to a lower limit of ~9.1 GHz. The B, fields were independently 
measured under the same conditions via the Rabi oscillation frequency (2) of 
a spin-1/2 EPR standard (the organic radical bisdiphenylene-2-phenylallyl dis- 
solved in polystyrene); B; values varied from ~4 G at 9.1 GHz, to 9G at 9.75 GHz 
(Qa= 11-25 MHz for s= 1/2). A two-pulse sequence (T/2-rT-T-T-echo, where 
T characterizes the pulse durations and 7 the delay time between pulses) was 
employed for all ESE measurements reported in this work. The values of T, 7 and 
the source power were optimized at each frequency, with the assumption that 
the optimum conditions correspond approximately to the Hahn-echo sequence, 
m/2-T-n-T-echo, where 7 refers to the tipping angle. For T; measurements, T was 
varied and the resultant echo amplitude then fitted to a single exponential decay. 
Pulse sequences. Because the ESE measurements were performed well below the 
centre frequency of the cavity, and owing to the lack of a priori knowledge of the 
matrix elements associated with the observed transitions, pulse sequences were 
adjusted at each frequency by one of two methods: (1) the 1/2 pulse length (7/2) 
and source attenuation were adjusted to maximize the echo intensity relative to 
the spectrometer noise for the ESE-detected spectra in Figs 2a and 4a, thereby 
explaining the variability of the vertical error bars denoting excitation bandwidth 
(defined as 2/T, or 1/T5/2, where T,/2 is the duration of the 7/2 pulse in the Hahn- 
echo sequence); and (2) Rabi oscillation measurements were used to determine 
the optimum 1/2 pulse length for the detailed T, measurements displayed in Figs 
3 and 4c, and Extended Data Figs 1 and 2 (the Rabi pulse sequence was optimized 
via method (1)). On this basis, a Rabi frequency, Qp=98 Mrads~! (15.6 MHz), 
was determined for 0 dB attenuation at the CTs, resulting in a minimum 7/2 pulse 
length of 16ns for the employed spectrometer. This corresponds to an optimum 
dephasing factor Q,,= 820, defined here as Q., = QT», a figure of merit for qubit 
operation. We note, however, that this does not preclude shorter pulses using a 
more powerful microwave source, suggesting the possibility of Q,, values up to 
1.5 x 10° using the modified definition in ref. 9. Interestingly, this value is identical 
to the one reported in ref. 9 for an Feg nanomagnet, in spite of the vastly different 
frequencies employed in the two measurements, primarily because of the much 
longer coherence in the HoWo system. Based on knowledge of the spectrometer 
used for the Feg study, we estimate a Q., = QgT> of just 50 for Fes; of course, the 
same arguments concerning limited source power apply in that case. The HoW jo 
Q,, value compares favourably with other candidate molecular spin qubits using 
both definitions—for example, the optimum Q,, (=(2gT)) varies from ~2,000 for 
the Cr7Ni wheel (ref. 10), up to ~10,000 obtained recently for a Cu"! coordination 
complex'®. However, one should bear in mind that extreme dilution/deuteration 
was employed in these cases. 
The spin Hamiltonian. The energy spectrum associated with the Hund’s 
rule spin-orbit coupled ground state of the Ho** ion, with L=6, S=2 and 
J=|L + S|=8, can be described by the effective Hamiltonian (equation (1) in the 
main text, reproduced here for convenience) 


k 
H= S\ SO BUOL+)-A-I+ pgBo- gy -J — UySyBo-l 
k=2,4,6 q=0 


The double summation describes the CF interaction in terms of extended Stevens 
operators 04 (k=2, 4, 6, and|q| < k), with associated coefficients Bt (refs 31, 32), and 
with Of expressed in terms of the total electronic angular momentum operators 
j and f; (i=x, y, Z). Using this convention, the axial (q=0) coefficients determined 
from magnetic and continuous-wave (CW) EPR measurements are?>4; 
By =0.601 cm, B}=6.96 x 10-3 cm~, and Bt}= —5.10 x 10-> cm™|. This param- 
eterization results in the my= +4 CF states lying lowest in energy (Fig. 1), separated 
from the m;=+5 excited states by ~20cm7! (ref. 23). The second term in equation 
(1) describes the hyperfine coupling between the Ho** electron and I= 7/2 nuclear 
spin, resulting in the observation of eight (2+ 1) well-resolved electro-nuclear 
transitions via high-field CW EPR measurements; here, T denotes the total nuclear 
angular momentum operator, and A the hyperfine coupling tensor, for which the 
parallel component, Aj = 830 + 10 MHz, has been determined from the high-field 
CW EPR spectrum. The final two terms in equation (1) respectively parameter- 
ize the electron and nuclear Zeeman interactions with the local magnetic induc- 
tion, Bo, in terms of a Landé g-tensor (g,) and isotropic nuclear g-factor (gy); fp 
and uy represent the Bohr (electron) and nuclear magneton, respectively. The 
parallel component of the Landé g-tensor, g,= 1.25(1), has been determined from 
CW EPR studies”*. 

In addition to the axial (q=0) CF parameters, CW EPR measurements at 


X-band frequencies can only be accounted for by including a sizeable tetragonal 
1 


BiO4 (Of= s(t +J*)) interaction, with Bi=3.14 x 10-3 cm! (see Fig. laand 
ref. 24 for detailed explanation). It is this term (which is allowed because of a small 
distortion of the HoW jo molecule away from exact D4q symmetry) that generates 
avoided level crossings between m;= +4 states, as seen in Fig. la. In principle, the 
sixth order tetragonal B¢O} interaction is also symmetry allowed. However, O¢ 
contains the commutator [ Ts G i +f*)] and is, thus, indistinguishable from 
O4 within the truncated mj=-+4 ground doublet. Therefore, we employ only 
the B{O} term to capture the effects of the distortion away from exact Dyq symme- 
try. The key point is that Ot connects the my= +4 states in second-order, resulting 
in unusually large (~9 GHz) quantum tunnelling gaps. For Bollz, the frequencies 
of the resultant weakly allowed EPR transitions between these states then follow a 
field-dependence of the form (see Fig. 1b) 


2 
ay 
a + V2(Boz as Bmin)? i] A 4s 3 (Bu _ Bmin)? (3) 


where the approximate quadratic expression applies for fields close to the gap 
minima, Bin. Indeed, because oy represents the only off-diagonal CF interaction 
in equation (1), an almost exact mapping of the first expression of equation (3) 
onto curves generated via exact diagonalization of equation (1) is possible, yielding 
the following parameters: A = 9.18 GHz, y,= 139.9 GHzT™! (=1.25 x 8 x jup/h, 
that is, g,= 1.25), and Bmin= 23.6, 70.9, 118.1 and 165.4 mT. This analysis assumes 
Bollz, while the experiments are typically performed with a small field misalign- 
ment (6 # 0), as noted above. However, due to the extreme uniaxial symmetry of 
the HoWo molecule, the perpendicular component of the effective gyromagnetic 
tensor associated with the m;= +4 doublet, 7, e<0.1GHzT ! (21 err < 0.01), 
resulting in a virtual insensitivity to the perpendicular component of the applied 
field (Bo) over the range explored in this investigation; for comparison, note that 
Yproton © 0.04 GHz T ! For this reason, one can approximate the electronic Zeeman 
term in equation (1) using a scalar interaction of the form, g ju Bock: (where 
Boz= Bocos@). Equation (3) then applies quite generally at the gap minima, pro- 
vided the applied field is rescaled to account for any misalignment. Hence all EPR 
spectra are plotted as a function of the longitudinal applied field component, Bo,. 
Importantly, the derivative df/dBy, — 0 (that is, yz¢— 0) as Boz Bmin, resulting 
in an almost complete insensitivity of the EPR transition frequencies at the gap 
minima to magnetic noise associated with the environment, thus giving rise to the 
strong T> divergences at the CTs. However, the small yet finite 7, ,. (<0.1 GHzT“') 
probably limits T, right at the CTs (within -£0.5 G of Bmin) in these studies due to 
the unavoidable field misalignment. In fact, 7, ,-f— 0 as Bosin# — 0, which may 
explain the longer T> values observed at the lowest field CTs in Fig. 3, and also 
suggests that longer Ts may be achievable in precisely aligned samples. 

T> scaling. The data displayed in Fig. 3 were obtained for a small crystal of the 
most dilute sample (x= 0.001). It is the high quality of this crystal that results in 
the sharp T> peaks at all four CTs (all four Bmin locations). However, it gives weak 
ESE signals, making it challenging to perform a detailed analysis of the scaling of 
T> with Bo,. Careful T,; measurements were therefore repeated for larger samples. 
Unfortunately, the larger crystals are susceptible to twinning that manifests as a 
broadening of spectral peaks and T, divergences, with the effect being most pro- 
nounced at the higher field CTs (see Fig. 2a). However, the first CT at 
Bin = 23.6 mT often remains sharp (see below for explanation). Extended Data 
Fig. 1 displays T, measurements for the x= 0.001 and 0.01 concentrations, plotted 
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against (Boz — Bmin) on both logarithmic (main panels) and linear (insets) scales. 
Similar to the data in Fig. 3, the T, peaks exhibit broad tails, with an apparent kink 
at |Boz — Bmin| 2 mT for the more dilute sample. However, when plotted on a 
log-log scale, the data follow a power law (to within the experimental uncertainty) 
spanning an order of magnitude in (Boz — Bmin) for x=0.001, and almost two 
orders of magnitude for x= 0.01, particularly on the high-field sides of the T> 
peaks. This apparent monotonic behaviour of the form Tx (Boz — Bmin) " sup- 
ports our assertion that the decoherence is dominated by dipolar field fluctuations 
that vanish as df/dBo, — 0. However, the exponent, n, is both sample-dependent 
(n=0.33 and 0.46, respectively, for x= 0.001 and 0.01), and different from previous 
predictions”®”*: n= 1 for indirect flip-flop processes (spin diffusion), and n=2 for 
instantaneous diffusion”. We believe that sample inhomogeneity is responsible 
for these differences in HoW po, thus masking the intrinsic T, dependence on Bo,, 
causing obvious sample-to-sample variability. It is nevertheless interesting that a 
power-law scaling still holds, as opposed, for example, to Gaussian behaviour. This 
clearly merits further theoretical investigation. 

Reduced ESE intensity and faster T, decay curves are part of the reason for the 
increased error bars and apparent broad tails seen in Fig. 3 and Extended Data Fig. 1. 
In addition, ESE-envelope-modulation (ESEEM)”’ is detectable in the decay curves 
recorded in these tails (not shown). However, only one to two heavily damped 
periods of oscillation can be seen, thus adding to the error in T> (not to mention a 
potential systematic error that is not taken into account in our analysis). It is these 
combined factors that likely explain the apparent kink in some of the data at 
|Boz — Bmin| 2 mT, as well as the weak variation in T, across the ‘normal transi- 
tions seen in the right-hand panel of Fig. 3. Interestingly, enough ESEEM periods 
can be detected to confirm that it is due to coupling to protons in the sample. 
Importantly, the ESEEM vanishes at the CTs, providing further strong evidence 
that the Ho** spin becomes decoupled from the surrounding dipolar spin bath as 
both (Boz —Brmin) and df/dBoz — 0. 

Spectral broadening. The EPR spectra of HoW jo are inhomogeneously broad- 
ened”, with the two main contributions originating from (i) crystal twinning and 
(ii) strain in the off-diagonal BY CF parameter. 

(i) Crystals of HoWo form as long thin needles that tend to aggregate into 
aligned bundles. Separating single crystals from these bundles can be challenging, 
particularly given that removal of the samples from their mother liquor for periods 
of more than a few minutes leads to sample degradation. Even after separation, our 
measurements suggest varying degrees of mosaic spread, particularly for the larger 
crystals. Indeed, simulations of high-field CW EPR spectra (where the effects of 
the mosaicity are more pronounced than at X-band) employed a Gaussian orien- 
tational distribution with a full-width at half-maximum (FWHM) of 1°, albeit for 
a small crystal”; the distribution is considerably broader for many of the samples 
employed for ESE measurements. Within the context of equation (2), this mode of 
disorder produces a spread in 7, and the Byin values, resulting in horizontal smear- 
ing of the energy levels in Fig. 1, as opposed to a vertical smearing produced by a 
distribution in Bj (see below). The horizontal smearing becomes more pronounced 
at higher fields, akin to g-strain. Consequently, the EPR spectra often become 
broader with increasing field, as is clearly evident in Fig. 2, and less so in Fig. 4. 

Although subtle, the effects of sample mosaicity are most pronounced at the 
CTs. The horizontal spread in the CTs results in a smearing of the divergence in 
T>. In general, the strongest/narrowest divergences were obtained for the small- 
est crystals, which have the smallest mosaic spread. It is for this reason that the 
data for the most dilute samples in Figs 2 and 3 were obtained for two different 
crystals: the large crystal employed in Fig. 2a did not produce particularly strong 
T> divergences, with maximum values reaching only ~2 |1s. Meanwhile, a smaller 
crystal was employed in Fig. 3: this sample gave very good echoes right at the CTs, 
in spite of its reduced spin count; however, its ESE spectra vanish into the noise 
upon moving appreciably away from the CTs. These trends can be attributed both 
to a T, weighting effect, which amplifies the otherwise weak ESE signals at the CTs 
for the more ordered (longer T>) sample, and to the narrower mosaic distribution 
that further enhances echoes at the CTs. Multiple small samples were studied, and 
optimum T) values at the CTs in the 6-8 j1s range were found in nearly all cases 
for the x= 0.001 and 0.01 samples (see Fig. 3 and Extended Data Figs 1 and 2). 

(ii) Other sources of inhomogeneous broadening include: strains in the spin 
Hamiltonian parameters (Bi, Aand g,), caused by microscopic disorder, and inho- 
mogeneities in By due to electron and nuclear dipolar fields. The latter may be ruled 
out as a major source of broadening at X-band (and 5K) due to weak sample 
magnetization and the lack of any systematic dependence of the EPR linewidth on 
Ho concentration. Meanwhile, the only effect of the diagonal (q=0) CF terms in 
equation (1) is to ensure an isolated m;= +4 doublet ground state with 
z= gy) [p/h = 139.9 GHz T-! J=8and gy= 1.25). Other than that, the low energy 
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spectrum exhibits little or no dependence on BY, BY and Be (ref. 24), and should 
thus be insensitive to strains in these parameters. For related reasons, and because 
of the contracted nature of the 4f shell and strong spin-orbit coupling, the g, and 
A tensors are relatively immune to local strains in the crystal structure (although 
the effective interactions will of course be sensitive to sample alignment due to the 
strong axial character of the CF). This leaves Bi which, indeed, has a profound 
influence on the X-band EPR spectrum, as clearly seen in Fig. 1a and discussed in 
detail in ref. 24: B4 directly sets the scale of the tunnelling gap, A, which is respon- 
sible for the CTs. 

The finite BY parameter arises because of a small deviation of the coordination 
environment around the Ho ion from exact D4q symmetry‘. The superposition 
of disorder onto this weakly distorted structure can then give rise to a relatively 
strong modulation of the local Bi parameter and, hence, to a broad distribution 
for the ensemble. Working under this assumption, we re-simulated CW X-band 
spectra obtained for an x=0.1 sample at a frequency of 9.64 GHz (figure 8 of 
ref, 24), assuming that the main source of broadening is a Gaussian distribution in 
Bi. The best simulation is obtained with a FWHM of 5.0 x 107? cm™|, that is, 
~1.6% of Bj (or a standard deviation, og44=2.1 x 10-5 cm~!). This, in turn, pro- 
duces a vertical distribution in the corresponding tunnelling gap, A. Because O4 
connects the m;= +4 states at the second order of perturbation, the resultant 
standard deviation of the gap distribution is given approximately by 
o aX 2Acpas/By = 4.1 x 10-9 cm! = 123 MHz (FWHM of 290 MHz), where 
A=0.306cm |! =9.18 GHz is the mean gap value (the factor of ‘2’ emerges because 


of the quadratic dependence of A on B}). 

Figure 1b depicts the Gaussian broadening of the EPR transition frequencies as 
a 3D colour map, with contours shown at the +o, and +2c, levels of the distri- 
bution. Because Bj affects only A, this mode of disorder does not shift the magnetic 
fields (Byin) at which the CTs occur for the different molecules in the distribution. 
However, it does distribute them vertically over a relatively wide frequency range 
(approximately +0.25 GHz at the 2c 4 level). This can explain the observation of 
ESE intensity exactly at the CTs over a wide frequency range for the concentrated 
(x=0.1) sample seen in Fig. 4. Because the cavity employed for these investigations 
has a centre frequency at 9.75 GHz, its sensitivity improves upon increasing the 
frequency from 9.1 to 9.4GHz. Meanwhile, the number of Ho** spins in the dis- 
tribution decreases with increasing frequency. These two factors approximately 
offset, explaining the relatively constant ESE intensity and signal-to-noise ratio 
across the studied frequency range. The ESE intensity does peak at 9.2 GHz, above 
which it decays, although not as rapidly as one may expect purely on the basis of 
the gap distribution. This is due to the increasing Bj field of the spectrometer, 
which enables excitation of more spins and hence the generation of stronger 
echoes at higher frequencies. 

This same behaviour is observable at the other concentrations. For example, 
CTs are very clearly observable in between the ‘normal’ EPR transitions over a 
wide frequency range at By, = 165 mT for the x= 0.01 sample, as seen in Extended 
Data Fig. 3. Further evidence can also be found at some of the higher frequencies, 
where inspection of Fig. la reveals crossings between nuclear sub-levels 
(Am;= +1) at fields exactly half way between the Byin values. If the applied field 
is not well aligned to the crystal z axis, these become avoided crossings (with 
<10 MHz gaps), giving rise to new CTs at these higher frequencies. This is a 
subtlety of the perpendicular field component, Bo, , which will be the subject of 
a future publication. The avoided nuclear sub-level crossings do not influence any 
of the conclusions concerning the CTs at the Bj gap minima (A). Nevertheless, 
the higher frequency CTs are observable, particularly at low fields where the 
effects of disorder due to sample mosaicity are less pronounced, and the ‘normal’ 
ESE transitions are quenched due to very short T)s (refs 2, 9). This is the expla- 
nation for the sharp double peaks seen for the x= 0.001 sample at ~50 mT 
between 9.4 and 9.7 GHz in Fig. 2a, as well as the sharp zero-field peaks and some 
of the fine structures seen between Bmin values at higher fields and frequencies. 
On the basis of the 50 mT CTs, one can see that the vertical broadening spans less 
than 400 MHz in this sample, that is, less than +200 MHz from the peak of the 
distribution. In other words, opga4 clearly varies from sample to sample, being 
smaller for the x= 0.001 concentration. This is the reason why intensity due to 
the low-frequency CTs (at Bin) is not discernible in between the broad ‘normal’ 
transitions in the most dilute sample in Fig. 2a. 


31. Rudowicz, C. & Chung, C. Y. The generalization of the extended Stevens 
operators to higher ranks and spins, and a systematic review of the tables of 
the tensor operators and their matrix elements. J. Phys. Condens. Matter 16, 
5825-5847 (2004). 

32. Stoll, S. & Schweiger, A. EasySpin, a comprehensive software package for 
spectral simulation and analysis in EPR. J. Magn. Reson. 178, 42-55 (2006). 
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Extended Data Figure 1 | T> scaling. a, b, Field-swept T, measurements 
for the x=0.001 (a) and x =0.01 (b) concentrations at 5 K; the data are 
plotted as a function of (Bp, — Bin) on both log-log (main panels) and 
linear (insets) scales. The blue lines are power-law fits to the positive 
(Boz — Bmin) data (green points), with the obtained exponents (‘slope’) 
given in the figures. Error bars, standard error in T). 
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Extended Data Figure 2 | T> divergence at the x = 0.01 concentration. 
Shown are field-swept T, measurements recorded at 5.0 K for two separate 
crystals at frequencies of 9.12 GHz (blue squares) and 9.20 GHz (red 
circles). Error bars, standard error in T). 
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Extended Data Figure 3 | ESE-detected spectra for the x= 0.01 
concentration. Variable frequency measurements at 5.0 K, with @ = 30°; 
the frequencies are indicated above each trace. Similar to spectra for the 
x= 0.001 sample, the broad 9.2 GHz CT peak splits into two upon moving 
away from the tunnelling gap minimum (see also Fig. 2). However, weak 
ESE intensity can still be detected at By, = 165 mT at all four frequencies. 
This is due to vertical broadening of the CT, caused by a Gaussian 
distribution in BY. 
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Three-dimensional control of the helical axis of a 
chiral nematic liquid crystal by light 


Zhi-gang Zheng!, Yannian Li!, Hari Krishna Bisoyi!, Ling Wang!, Timothy J. Bunning? & Quan Li! 


Chiral nematic liquid crystals—otherwise referred to as cholesteric 
liquid crystals (CLCs)—are self-organized helical superstructures 
that find practical application in, for example, thermography’, 
reflective displays”, tuneable colour filters** and mirrorless lasing™®. 
Dynamic, remote and three-dimensional control over the helical axis 
of CLCs is desirable, but challenging”*. For example, the orientation 
of the helical axis relative to the substrate can be changed from 
perpendicular to parallel by applying an alternating-current electric 
field’, by changing the anchoring conditions of the substrate, or by 
altering the topography of the substrate’s surface’®-"°; separately, 
in-plane rotation of the helical axis parallel to the substrate can 
be driven by a direct-current field!”-!°. Here we report three- 
dimensional manipulation of the helical axis of a CLC, together with 
inversion of its handedness, achieved solely with a light stimulus. We 
use this technique to carry out light-activated, wide-area, reversible 
two-dimensional beam steering—previously accomplished using 
complex integrated systems”° and optical phased arrays”!. During 
the three-dimensional manipulation by light, the helical axis 
undergoes, in sequence, a reversible transition from perpendicular 
to parallel, followed by in-plane rotation on the substrate surface. 
Such reversible manipulation depends on experimental parameters 
such as cell thickness, surface anchoring condition, and pitch length. 
Because there is no thermal relaxation, the system can be driven 
either forwards or backwards from any light-activated intermediate 
state. We also describe reversible photocontrol between a two- 
dimensional diffraction state, a one-dimensional diffraction state 
and a diffraction ‘off’ state in a bilayer cell. 

According to Bragg’s law, when CLCs are in a planar cell—and 
hence their helices are in a ‘standing helix’ (SH) orientation, per- 
pendicular to the substrate—modulating the helical pitch length 
produces tuneable, selective reflection of circularly polarized light. 
In contrast, CLCs in a homeotropic cell—where the helical axes are 
in a ‘lying helix’ (LH) orientation, parallel to the substrate’s surface, 
but randomly oriented—exhibit a fingerprint optical texture. Such an 
LH arrangement has allowed rotational manipulation of microscale 
objects on the surface of CLC films”. On the other hand, a ‘uniform 
LH arrangement, in which the helical axes are oriented along a sin- 
gle direction, produces an optical texture of uniform periodic stripes 
perpendicular to the helical axis, and an in-plane, periodic modulation 
of the refractive index along the helical axis. Varying the pitch length 
of the uniform LH arrangement can modulate the diffraction angle, 
enabling non-mechanical beam steering and spectrum scanning along 
a one-dimensional line®”?. A wide in-plane rotation angle of the helical 
axis has been produced in a hybrid cell (with one substrate treated for 
vertical alignment, the other for homogeneous alignment) by light 
irradiation", but the helical axis could not be transformed from the 
LH to the SH state. Other work has used independent external stimuli 
to transform standing helices to lying helices, or to achieve in-plane 
rotation of uniform lying helices”"'®. Here, we use light to induce 
both events sequentially—transformation of the SH to the uniform 


LH arrangement, followed by in-plane rotation—enabling three- 
dimensional control over the helical axis. 

We used our synthesized, dithienylcyclopentene-based, axially 
chiral molecular switch—(S,S)-D4 (Fig. la and Supplementary 
Information)—as the dopant (1.2 mol%) in the commercially available 
achiral nematic liquid crystal E7, to fabricate a self-organized, optically 
tuneable CLC. The helical twisting power (HTP) of (S,S)-D4 in E7 has 
previously been determined in a wedge cell”; a texture transition of 
such a CLC confined in a homeotropic cell has also been observed?*, 
although no reversible SH-to-LH transition and in-plane rotation of the 
LH have previously been found. This axially chiral switch shows excell- 
ent fatigue resistance with superior thermal stability in both its ring- 
open and its ring-closed states™*. Upon irradiation with ultraviolet light 
at 310 nm, the ring-open structure is transformed into the ring-closed 
isomer; the helical superstructure changes handedness, from the initial 
right-handed to a left-handed form; and the HTP is enhanced. The 
reverse process occurs upon irradiation with visible light at 550 nm. 

We filled this photoresponsive CLC into a homogeneous planar cell 
(where the two substrates were aligned antiparallel to each other, and 
the cell gap, that is the gap between the top and bottom substrates, was 
3.7 £0.1,m). We used a polarizing optical microscope in transmis- 
sion mode to study the sample (Fig. 1b). Initially, it is in a bright state, 
indicating the expected Grandjean planar texture—standing helices. 
Upon irradiation with ultraviolet light for 5 seconds, the bright state 
transforms into a dark state, corresponding to the unwound nematic 
phase, resulting from the homogeneous alignment of liquid-crystal 
(LC) molecules (parallel to the polarization direction of incident light). 
After 10 seconds of irradiation, the bright state reappears (indicating 
the emergence of standing helices with opposite handedness), followed 
by the appearance of the periodic stripes that indicate a uniform LH 
arrangement, and accompanied by simultaneous in-plane rotation of 
the stripes and pitch contraction until the system reaches the photo- 
stationary state (PSS). In contrast with aforementioned work’~"’, here 
the uniform LH arrangement is formed only by light irradiation. This 
LH structure can be erased and driven reversibly with visible light 
irradiation, as follows: the stripes rotate in the opposite direction; 
the distance between two adjacent stripes increases; the helices align 
perpendicularly, producing a left-handed SH arrangement; this left- 
handed structure unwinds and reorganizes to produce the right-handed 
SH arrangement; and eventually the uniform LH texture of the right- 
handed CLC is regenerated (Fig. 1d). Thus, the direction of the helical 
axis of CLCs can be manipulated in three dimensions solely by light 
(Fig. 1c). Moreover, the CLC system in any stimulated intermediate 
state is stable, without showing thermal relaxation, because of the ther- 
mal stability of the chiral molecular switch in both of its isomeric states. 

The light-induced uniform LH arrangement might be produced in 
two main ways: first, through development of a large oblique or a verti- 
cal alignment of the LC molecules (this would benefit LH formation by 
coupling with the chiral effects); and second, through sufficient surface 
anchoring to maintain the orientation of the stripes in a single direction 
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Figure 1 | Light-induced three-dimensional control over the helical 
axis of a CLC. a, Molecular structure of the photodynamic, switchable, 
chiral molecular switch with thermal stability, (S,S)-D4. b, Ultraviolet 
(UV) irradiation (wavelength 310 nm, intensity 2.2 mW cm’) of the 
chiral switch in a planar cell transforms the helical superstructure from 
its original standing helices (SH), through the unwound homogeneous 
state and the SH arrangement with opposite handedness, to the lying helix 
(LH) arrangement. The LC arrangements are shown schematically below 
the polarizing optical micrographs. The pink arrows denote the direction 
of the helical axis in the cell. c, Illustration of reversible, light-induced, 
three-dimensional control over the direction of the helical axis. After the 
left-handed LH is obtained by UV exposure (i), the sample is triggered by 
visible light (vis; 550 nm)—producing, in sequence, a clockwise in-plane 
rotation (ii); transformation from the left-handed LH to left-handed 

SH organization (iii); unwinding of the left-handed SH to generate a 


(achieved by planar surface anchoring of the cell). To investigate these 
possibilities, we carried out molecular-dynamics simulations of the 
photoresponsive CLC (Supplementary Fig. 4); the results are consist- 
ent with Landau-de Gennes’ elastic theory". Specifically, the results 
indicate an oblique alignment of LC molecules, resulting from the 
coupling of the elastic energy with the molecular interactions between 
the chiral switch and LC molecules during photoisomerization. The 
cell gap-to-pitch ratio (d/P) is another critical factor in LH formation, 
and represents the coupling effects from the surface anchoring and the 
twist elastic energy. We found that the measured value of d/P was very 
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homogeneous alignment (iv); and reappearance of the right-handed SH 
arrangement (v). Further stimulation with visible light causes the 
right-handed standing helices to lie down again (vi), and then to form 
the right-handed lying helices and to rotate clockwise in-plane (vii) when 
the system reaches the visible photostationary state. This whole process 
can then be driven backwards to the original state by UV light irradiation 
(blue arrows). This reversible sequence of events in a continuous process 
establishes the three-dimensional manipulation of the helical axis. 

d, Evolution of the optical texture of the CLC during visible light 
irradiation from the UV photostationary state. The angles between 

the direction of the stripes (white dashed lines) and the horizontal 

line (rubbing direction, black dashed lines) are labelled. The threshold 
state is defined as the state in which stripes begin to appear from the 

SH arrangement. All micrographs were taken under polarizing optical 
microscope with crossed polarizers (P, polarizer; A, analyser). 


close to integer multiples of 0.5, implying that the LH is obtained only 
when an appropriate trade-off is reached between the surface anchoring 
and twist elastic energy. The propensity to form a uniform LH arrange- 
ment decreased as the d/P value increased. 

The direction of the helical axis in the LH arrangement is deter- 
mined by the azimuthal angle of the director of LC molecules in the 
middle layer of the cell”®; changes in this angle lead to the light-induced 
in-plane rotation of the helical axis. After photoisomerization, the 
chiral switch undergoes a dramatic change in its molecular structure 
(Supplementary Fig. 7), which would cause a large change in the LC 
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Figure 2 | Light-controllable two-dimensional beam steering for 
spectrum scanning. a, i-ix, Two-dimensional beam steering, resulting 
from the simultaneous in-plane rotation and pitch modulation of a CLC 
helix driven by UV light (310 nm). Such beam steering can be turned off (v) 
and on (vi) solely by light irradiation. b, Diagram showing light-driven, 
two-dimensional beam steering for spectrum scanning. From left to right, 
the helical pitch is compressed and the helical axis is rotated during UV 
irradiation. The pitch compression leads to an increase in the diffraction 
angle for every wavelength; thus the diffraction pattern shifts away from 
the centre (purple straight arrow), leading to a change in the diffraction 
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Figure 3 | Light-induced diffraction dimensionality transformation of a 
bilayer CLC sample. The UV intensity was 0.4mW cm ~’; the irradiation 
time is shown in the top left of each panel. a-l, Diffraction patterns were 
transformed from two-dimensional (2D; a, b), through one-dimensional 
(1D; c) and the diffraction off state (d), to 1D again (e, f), and finally to 
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angle at the edge of the pattern (from w to ¢). Because of the rotation of 
the helical axis, the diffraction pattern also rotates anticlockwise (with 

an angle of 7). The arrows on the LC cells at the top show the direction 

of the helical axis. The scanning range is defined as 7 x (¢— 7). The 
diffraction pattern can be shortened and rotates clockwise by changing 
the stimulus to visible light (550 nm). The scanning rate can be further 
accelerated by increasing the light intensity, and the beam-steering process 
can be reversibly driven back-and-forth from any intermediate state by 
alternating UV and visible light irradiation (Supplementary Fig. 14). 


yAD) quacyilateral 2D quaqrilateral 


2D (g-l). Green dashed lines, connecting the diffraction spots, reflect 
the deformation of diffraction properties. The interior angle of the 
rhomboidal quadrilateral is modulated by rotation of the lying helices 
within each layer, and the transmitted diffraction angle is modulated by 
changes in pitch length. 
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Figure 4 | Helical arrangements of CLCs in a bilayer cell upon UV 
irradiation. a, Irradiation-time-dependent CLC arrangements in the 
upper and lower layers of the bilayer (UV intensity, 0.4 mW cm~’). 

As the UV light propagates from top to bottom, helical transformations 
occur as shown by the colour key. Combining the arrangements in 

the upper and lower parts leads to different diffraction effects as a 

laser probe impinges from bottom to top. Accordingly, there are nine 
different possible LC arrangements in a bilayer cell, which form different 
diffraction patterns. b, i-ix, The nine different possible arrangements, 
with corresponding time intervals. A one-to-one correspondence 


direction in the middle layer, leading to substantial in-plane rotation 
of the helical axis (Supplementary Fig. 10). However, the rotation of 
the helical axis can be suppressed when the LC direction is strongly 
pinned using an applied electric field (Supplementary Fig. 11). If the 
surface anchoring is too weak to resist the external disturbance, the 
formation of the uniform LH arrangement appears not to be favourable 
(for example, in the case of the larger cell gap shown in Supplementary 
Figs 8 and 9), or the conventional polydomain fingerprint texture of the 
CLC is generated (Supplementary Fig. 5k, 1). Planar anchoring with a 
smaller cell gap seems to be favourable for realizing three-dimensional 
dynamic photocontrol of the CLC helix. Thus, the three-dimensional 
manipulation of the helical axis depends on a delicate interplay among 
cell thickness, surface anchoring, pitch length and external stimuli. 
To investigate potential applications of this light-induced, three- 
dimensional manipulation of the CLC helical axis, we explored 
non-mechanical two-dimensional (in-plane) beam steering (Fig. 2). 
We observed a chromatic dispersion as a collimated white probe light 
impinged on the uniform LH arrangement along the cell normal 
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between the diffraction pattern and the LC arrangement can be 
established; for example, the diffraction patterns shown in Fig. 3a 
(UV 0s) and Fig. 3b (40s) correspond to the arrangement shown in 
panel b, i of this figure (UV 0-60 s). The whole process can be briefly 
described as follows: the 2D diffraction pattern is caused by double 
effects from two adjacent LH layers (i, ix); the 1D diffraction pattern 
results from a bilayer containing an LH layer and either an SH or 


switched off if there is no LH arrangement in the bilayer cell (iv, v). 


(Fig. 2a(i)). Stimulation with ultraviolet light led to a simultaneous 
change in helical pitch and in-plane rotation of the stripes of the LH 
(rotation of the grating vector)—causing the diffraction angle of every 
wavelength to vary, and enabling two-dimensional in-plane beam steer- 
ing, which can potentially be applied in spectrum scanning (Fig. 2b). 
The chromatic dispersion is gradually eliminated by continuous 
irradiation, because of the decreasing diffraction angle of every 
wavelength resulting from elongation of the helical pitch. At a time 
stamp of 20-48 seconds (Fig. 2a(v)) the diffraction has disappeared, 
because the uniform LH arrangement has transformed into either the 
SH structure or the unwound homogeneous alignment. Upon further 
irradiation (to 55 seconds), the LH arrangement with the opposite 
handedness re-forms and rotates, diffraction reappears, and the 
diffraction angle increases gradually owing to compression of the CLC 
pitch, until the sample reaches the PSS (Fig. 2a(ix)). Overall, a wide 
two-dimensional scanning range of about 52° x 8° (defined in Fig. 2b) 
is enabled, which is substantially larger than that of 23° x 3.6° reported 
recently”®. Such wide, non-mechanical beam steering is desirable 
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for free-space optical communication, adaptive-optics systems 
and phased-array radar. 

The manipulation and deformation of a two-dimensional beam- 
spot array is an interesting and challenging task, although metastable 
two-dimensional gratings have been encountered by chance”””*, and an 
electric-field-induced two-dimensional grating in a cholesteric polymer 
system has been reported”’. We achieved a reversible dimensionality 
transformation—from a stable two-dimensional diffraction pattern, via 
a one-dimensional pattern, to a diffraction off-state—by irradiating a 
specially designed, LC bilayer cell containing two thin, stacked LH layers 
(in which the surface directions of the adjacent layers were perpendic- 
ular). Figure 3a shows the initial, two-dimensional diffraction array, 
caused by successive diffraction from two LH layers. When one LH 
arrangement is converted to either an SH or a homogeneous alignment 
through irradiation, the diffraction pattern converts from a two-dimen- 
sional grid (the rhomboidal quadrilateral encircled by green dashed lines 
in Fig. 3), to a one-dimensional line, to a direct transmission pattern 
(indicating the diffraction off-state; Fig. 3d). Further exposure leads 
to handedness inversion and thus to a reappearance of the LH struc- 
tures, yielding first a one-dimensional diffraction pattern, and finally 
the two-dimensional pattern. The deformation of the envelope area and 
of the interior angles of the rhomboidal quadrilateral arises from the 
photomodulation of the CLC pitch and in-plane rotation of the helical 
axis. The time sequence of these changes is due to a progressive fall in the 
intensity of light, occurring because of photo-absorption by the chiral 
switch when light passes through the bilayer cell. 

The conventional one-dimensional diffraction pattern emanates from 
one uniform LH layer of the bilayer cell, whereas the two-dimensional 
diffraction pattern develops as a result of combined diffraction effects 
from two adjacent LH layers (Fig. 4b(i)). Note that the initial two- 
dimensional grating (Fig. 3a) arises from two right-handed LH 
layers, whereas the reappeared two-dimensional diffraction pattern 
(Fig. 3g-1) results from two left-handed uniform LH layers. The 
irradiation-time-dependent CLC arrangements in the upper and lower 
layers are illustrated in Fig. 4a. Figure 4b depicts LC arrangements 
that might yield the diffraction patterns shown in Fig. 3. It is also 
conceivable that the one-dimensional diffraction pattern might be 
switched on or off by changing the incident direction of the probe 
laser—analogous to the effect of a diode on current—which might 
enable new optical devices. 

In conclusion, we have achieved light-induced, three-dimensional 
control of the helical axis of self-organized CLCs, resulting in a reversible 
transformation between an SH and a uniform LH arrangement, with 
control of both the in-plane rotation angle of the helical axis and 
the pitch length. This enables reversible, light-driven, wide-area, 
two-dimensional in-plane beam steering. Moreover, we have accom- 
plished a light-induced reversible transformation between two- 
dimensional and one-dimensional diffraction patterns and a diffrac- 
tion off-state by irradiating a bilayer LC cell. The absence of thermal 
relaxation for this chiral switch enables on-demand, digital control of 
both the beam direction and the dimensionality of the diffraction array, 
starting from any desired state and effected exclusively by light. Our 
work is a step towards the realization of complex, light-activated smart 
systems and dynamic, reconfigurable three-dimensional architectures. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


The dithienylcyclopentene-based axially chiral molecular switch was syn- 
thesized and identified according to the route given in the Supplementary 
Information, Section 1. The chiral switch was mixed with the commercially 
available nematic LC E7 at a concentration of 1.2 mol%. The mixture was 
injected into a planar cell with antiparallel alignment of the substrate surfaces 
at the clearing point by capillary action, slowly cooled to 52.5 °C, and stored in 
the dark for about 3 hours to eliminate convection of liquid crystals owing to 
temperature gradients. 

The sample was irradiated using a wide-spectral-band white source (Asahi 
Spectra LAX-Cute, Japan), fixed with an appropriate narrow-band filter (full width 
at half-maximum = 10 nm). The intensity of the incoming light was measured 
using an optical power meter with a highly sensitive silicon-photodiode detec- 
tor (Thorlabs PM100USB). A fibre-connected white-light source (Ocean Optics 
LS-1) with a spectral range from 330 nm to 830 nm was used to characterize the 
performance of the two-dimensional beam steering for spectrum scanning, while 
a monochromatic laser with a wavelength of 650 nm was collimated and adopted as 
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the probe beam to study the dimensionality transformations of the bilayer sample. 
A spatial filter (Newport 900PH-25) was used to eliminate the stray light. To 
achieve two-dimensional beam steering and the bilayer diffraction device, the 
uniform LH arrangement was first induced by ultraviolet irradiation, then followed 
by visible light irradiation to drive the sample to PSS,is. The texture transforma- 
tion during the light irradiation was confirmed by polarizing optical microscopy 
(Leitz, Laborlux 12 POL), and simultaneously the temperature of the sample was 
maintained by a hot stage (Mettler Toledo FP82HT). The pitch of the sample was 
determined through the conventional Cano wedge method. 

The bilayer cell was fabricated using three optically transparent flexible polymethyl 
methacrylate (PMMA) substrates separated by mylar film spacers of identical 
thickness (41m). The substrates were first coated with an antireflection film to 
eliminate Fresnel reflection and then a polyvinyl alcohol (PVA) film to induce 
planar homogeneous alignment of the liquid-crystal molecules. The rubbing direc- 
tions of the PVA films in the upper and lower layers of the bilayer cell were crossed 
with each other. The structure of the bilayer cell is described in Supplementary 
Information, Section 12. 
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The contribution of China’s emissions to global 


climate forcing 


Bengang Li), Thomas Gasser*“, Philippe Ciais*, Shilong Piao>®, Shu Tao!, Yves Balkanski*, Didier Hauglustaine’, 
Juan-Pablo Boisier*, Zhuo Chen!, Mengtian Huang!, Laurent Zhaoxin Li®, Yue Li!, Hongyan Liu!, Junfeng Liu!, Shushi Peng!, 
Zehao Shen!, Zhenzhong Sun!, Rong Wang”, Tao Wang’, Guodong Yin!, Vi Yin’, Hui Zeng!, Zhenzhong Zeng! & Feng Zhou! 


Knowledge of the contribution that individual countries have made 
to global radiative forcing is important to the implementation of the 
agreement on “common but differentiated responsibilities” reached 
by the United Nations Framework Convention on Climate Change. 
Over the past three decades, China has experienced rapid economic 
development!, accompanied by increased emission of greenhouse 
gases, ozone precursors and aerosols”, but the magnitude of the 
associated radiative forcing has remained unclear. Here we use a 
global coupled biogeochemistry-climate model*” and a chemistry 
and transport model® to quantify China’s present-day contribution 
to global radiative forcing due to well-mixed greenhouse gases, 
short-lived atmospheric climate forcers and land-use-induced 
regional surface albedo changes. We find that China contributes 
10% + 4% of the current global radiative forcing. China’s relative 
contribution to the positive (warming) component of global 
radiative forcing, mainly induced by well-mixed greenhouse gases 
and black carbon aerosols, is 12% + 2%. Its relative contribution to 
the negative (cooling) component is 15% + 6%, dominated by the 
effect of sulfate and nitrate aerosols. China’s strongest contributions 
are 0.16 + 0.02 watts per square metre for CO) from fossil fuel 
burning, 0.13 + 0.05 watts per square metre for CH4, —0.11 +£0.05 
watts per square metre for sulfate aerosols, and 0.09 + 0.06 watts 
per square metre for black carbon aerosols. China’s eventual goal 
of improving air quality will result in changes in radiative forcing 
in the coming years: a reduction of sulfur dioxide emissions would 
drive a faster future warming, unless offset by larger reductions 
of radiative forcing from well-mixed greenhouse gases and black 
carbon. 

We first estimate the relative contributions of China to the various 
present-day radiative forcing (RF) components. These relative con- 
tributions are then combined with the Intergovernmental Panel on 
Climate Change (IPCC)’s best guesses’ of global RFs to deduce China’s 
absolute contributions. Only three known anthropogenic perturba- 
tions of RF are missing from our quantitative assessment: secondary 
organic aerosols, black carbon deposition on snow, and the indirect 
effects of aerosols. 

We apply a new global coupled biogeochemistry—climate model 
to calculate net changes in atmospheric concentration of well-mixed 
greenhouse gases (WMGHGs) by balancing prescribed historical 
anthropogenic emissions against natural removal processes. This 
model of reduced complexity, called OSCAR v2.1, is enabled by the 
more complex Earth system models*> upon which its parameters were 
calibrated. Additionally, the fully fledged LMDz4-INCA3 chemistry 
and transport model’ is used to calculate the evolution of the direct 
RE of short-lived atmospheric climate forcers (SLCFs). Finally, the 
RF resulting from altered surface albedo following historical land-use 


change (LUC) in China is estimated through an observation-based 
surface albedo reconstruction’. 

In all these model integrations, we isolate the marginal contribution 
of Chinese emissions to the simulated RFs using factorial simulations. 
This attribution method has been used in previous studies*?!°; and 
it is further described in Methods. This approach ‘tags’ each radia- 
tively active species in the models, following the cause-to-effect chain 
from the emission to the induced radiative forcing. The principle of 
this causal attribution is in line with the “common but differentiated 
responsibilities” acknowledged by the parties to the United Nations 
Framework Convention on Climate Change (UNFCCC)!!"!”. The 
year 2010 is taken to be the present day, and up until the year 1750 is 
considered to be pre-industrial, according to the IPCC. 

Figure 1 shows the relative and absolute contributions of histori- 
cal Chinese emissions since 1750 to each component of global RF in 
2010. Overall, China contributes 10% +£4% (0.30 +£0.11 W m7? out of 
2.88 + 0.46 W m ~”) of the current net global RF from anthropogenic 
emissions since 1750. This contribution is the sum of two terms with 
opposite signs. China contributes 12% +2% (0.48 £0.09 W m ” out of 
4.13 +£0.40 W m ~”) of the global positive RF from WMGHGs, tropo- 
spheric ozone and black carbon aerosols, and 15% + 6% (—0.18 + 0.06 
out of —1.26+0.24W m ”) of the global negative RF from LUC- 
induced surface albedo changes, stratospheric ozone, the effect of 
ozone precursors on CH, lifetime, and sulfate, nitrate and particulate 
organic matter aerosols. 

In absolute terms, the greatest contribution Chinese emissions 
make to global RF is from CO, produced by fossil fuel burning 
and cement production (0.16 + 0.02 W m ”). This contribution is 
followed by that of CH, from anthropogenic sources, including its 
effects on ozone and water vapour (0.13 +£0.05 W m ”), sulfate aer- 
osols produced by SO, emission (—0.11 + 0.05 W m~”) and black 
carbon aerosol emission (0.09 + 0.06 W m~”). In relative terms, 
China contributes substantially more to the global RF induced by 
aerosols (for example, 28% + 10% of sulfate aerosols, 24% + 8% of 
nitrate aerosols, and 14% + 6% of black carbon aerosols) than to the 
RF induced by WMGHGs. 

It is important to note that the contribution of China to current 
global annual anthropogenic emissions is larger than its contribution 
to radiative forcings (Fig. 1). For WMGHGs that have long atmos- 
pheric lifetimes’ (from a few decades to several centuries), the legacy 
of past emissions from countries that began to emit early (such as 
Europe and the USA) still have a contribution to present-day RF larger 
than that of China, despite China's much higher emissions nowadays. 
In contrast, because SLCFs have short atmospheric lifetimes’ (from 
days to months), it is the spatial distribution of current emissions, 
and the local processes controlling their atmospheric transport and 
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Figure 1 | Attribution of present-day global RF and its components to 
China. a, The global RF components and their uncertainty, as estimated 
by the IPCC’. b, The relative contributions of China to the various 
components of global RF in 2010, with our assessment of uncertainties 
(see Methods). When one component of the RF is driven by only one 
species, China's relative contribution to present-day emissions of that 
species is also shown as an empty bar oulined in the same colour. c, The 


removal, that determine the contribution of Chinese emissions of 
ozone precursors and aerosols to the global RF. 

Figure 2 shows the absolute and relative historical contributions 
of Chinese emissions of WMGHGs. During the last 150 years, these 
contributions were dominated by LUC CO, emission. The absolute 
contribution of China to global RF through its LUC CO, emission 
progressively increased until 1980. After 1980, the LUC CO; contri- 
bution started decreasing, as the effect of large-scale reforestation and 
afforestation programmes in China changed the land-use sector into a 
net sink of CO,'*-’. Despite the sharp rise of fossil fuel emissions since 
the 1960s, it was only around 1992 that the absolute contribution of 
fossil fuel CO2 emissions to the RF from China exceeded that of LUC 
CO, emissions. Since that time, fossil fuel CO emissions have been the 
predominant warming factor. At present, LUC CO remaining in the 
atmosphere from past emission still accounts for one-third of China's 
contribution to global RF induced by CO, against two-thirds for fossil 
fuel CO). This partitioning occurs despite the fact that the ‘instan- 
taneous’ fossil fuel emissions of China are today (in 2010) a source 
of 2.32 £0.21 petagrams of carbon per year (Pg C yr~') (26% +3% 
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absolute contributions of China to the RF components and uncertainties, 
as obtained by combining the values of the two previous panels using a 
Monte Carlo approach (n = 50,000). The “Total columns of a and c 

are obtained through Monte Carlo summation (” = 50,000) of the 
corresponding RF components; the ‘Total’ column of b is then deduced 
through the element-wise ratio of these two Monte Carlo ensembles. 

All uncertainties are one standard deviation. 


of the global total), and LUC is a net sink of -0.11 £0.04 Pg C yr7! 
(-16% + 7% of the global total). The RFs induced by the other 
WMGHGs emitted by China also show an increasing trend since 
1750, but with an acceleration after the 1950s for CH, (the absolute 
contribution of this greenhouse gas to China's RF increased by a factor 
of about three between 1950 and 2010) and another at the beginning 
of the 1990s for halogenated compounds. 

Despite the increasing absolute contribution of China to the 
global RF of all WMGHGs, the relative share of China to this RF has 
remained surprisingly stable (between 8% and 12%) over the last 150 
years (Fig. 2, inset). It decreased slightly until 1900, when industry 
was first introduced to China. It then progressively increased, reach- 
ing a maximum around 1960, mainly owing to high LUC CO) emis- 
sions. From 1960, the relative contribution of China to the global RF 
of WMGHGs decreased slightly for more than 30 years until 2000, 
when it again started increasing. The decreasing relative contribu- 
tion during 1960-1990 is the result of several factors. This period 
started with the “Three-year Natural Disaster’ and the ‘Great Chinese 
Famine’ (1959-1961); this was followed by China's Cultural Revolution 
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Figure 2 | Historical time series of China’s absolute and relative 
contributions to the RF of WMGHGs. The coloured plain lines are the 
best-guess absolute or relative contributions of China and the coloured 
shaded areas are the standard deviations coming from our uncertainty 
assessment (see Methods). China’s relative contributions (inset) are 
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Figure 3 | Spatial distribution of RFs from China-induced SLCFs. 
All-sky RF of the SLCFs induced by China through emission of 
short-lived pollutants and precursors in 2010. These RFs are direct outputs 


given as the percentage of the global RF of the single compound, except for 
the ‘All WMGHGs line. See Supplementary Information section A.1 for 
further discussion on the role of CO, from LUC. Contributions are shown 
only from 1850 for clarity, but pre-industrial is taken as being pre-1750. 
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from the LMDz-INCA model. a, The net RF of all the SLCFs combined. 
b-f, The RFs from black carbon, sulfates, tropospheric ozone, nitrate and 
particulate organic matter, respectively. 
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Figure 4 | Recent decadal trends of China’s contributions to the global 
REF and its components. a, The evolution of China’s relative contributions 
over the last thirty years with time slices in 1980, 2000 and 2010. b, The 
decadal trends of these relative contributions. c, d, As for a and b, but for 
China's absolute contributions. The absolute value of global RF and its 
trend, divided by a factor of ten, is also shown, in black. Uncertainties are 
not shown for the sake of readability. However, solid lines mark statistically 


(1966-1976), during which industrial activities and emissions slowed 
down. From 1982 onward, China's reform and open policies led to the 
current period of rapid economic growth', resulting in increased RF 
of WMGHGzs. However, at the same time as industrial activities were 
developing in the 1980s, changed land-use policy dominated by forest 
planting resulted in LUC emission declining and eventually creating 
a carbon sink!*-!* (see also Supplementary Information section A.1). 
The albedo effect of LUC is discussed in Supplementary Information 
section A.2. 

The global direct RF from China resulting from all SLCFs together 
is estimated to be —0.05+0.09 Wm ~~” in 2010, which corresponds 
to a slight yet not statistically significant cooling effect. However, 
this value is the difference between two large and opposite terms: a 
large negative RF caused by sulfate, nitrate and particulate organic 
matter aerosols, and a large positive RF caused by ozone precursors 
and black carbon aerosols (Fig. 1 and Supplementary Fig. 1). It is 
thus in fact fortuitous that today the cooling and warming effects 
of the different SLCFs emitted by China nearly compensate for one 
another. 

Note that the direct RF of SLCFs at the regional scale over China is 
much more pronounced than at the global scale (Fig. 3). The combined 
RF of SLCFs caused by China is negative over East Asia, and remains 
slightly negative over most of the globe. For China’s SLCF emissions, 
the RF effect of particulate organic matter is mostly local. The effect 
of nitrate aerosols influences the whole of East Asia, while the RF of 
sulfate, black carbon and tropospheric ozone caused by emissions in 
China affects the entire Northern Hemisphere; this is due to the longer 
lifetimes of these species in the atmosphere with respect to chemistry 
and deposition processes, compared to other SLCFs. 
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@ Halogenated 

compounds 
significant trends (P < 0.05) between two periods, and dashed lines mark 
moderately significant trends (P < 0.20). (Dotted lines mark trends that 
are not significant.) Thus, a solid line in a or c shows a significant trend in 
speed of change (first derivative) in China-induced RE, and one in b or d 
shows a significant trend in its acceleration (second derivative). Symbols are 
introduced for clarity only. 


Over the last thirty years, China's relative contribution has remained 
close to 10% of global RF (Fig. 4a). This contribution shows a slight 
positive trend over the period (less than a percentage point per 
decade; Fig. 4b), although it is not statistically significant. The absolute 
contribution of China to global RF has increased over the same period 
(Fig. 4c) with a noticeable—yet again not significant—acceleration 
over the last decade happening while the trend of global RF has 
remained approximately steady (Fig. 4d), thus explaining the slight 
positive trend in China’ relative contribution. As before, these trends 
result from opposite trends caused by warming and cooling species, 
as also shown in Fig. 4. 

Of all the species considered in this study, only the positive trend 
in the relative and absolute contributions of China to the RF of fossil 
fuel CO, emissions is statistically significant (P < 0.05). This species 
also exhibits a moderately significant (P < 0.20) acceleration of its 
absolute contribution to global RE, meaning that the strong increase 
in Chinese emission of fossil fuel CO2 over the past decades can already 
be detected, in terms of RE, by our attribution framework. Despite not 
being statistically significant, the trends we estimate for the RF of other 
species are worth mentioning because they provide key elements in 
assessing the near-future contribution of China to global climate forc- 
ing. We find a positive trend in Chinese relative and absolute contri- 
butions to the RF induced by all WMGHGs except LUC CO; and 
these trends see minor accelerations over the recent past. For SLCFs, 
China’ relative contribution to the global RF of sulfate, nitrate, par- 
ticulate organic matter and black carbon aerosols has also increased 
over the last thirty years. Chinese sulfates show an accelerating con- 
tribution, whereas nitrate shows a decelerating contribution to the net 
cooling effect. China’s particulate organic matter and black carbon 
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contributions increased at a relatively steady pace. Finally, the recently 
decelerating net RF attributed to China’s emission of ozone precursors 
results from two opposite effects: a steadily increasing tropospheric 
ozone burden (which is a warming effect), and an accelerating reduc- 
tion of the CH, lifetime mainly caused by accelerating NO, emissions 
(a cooling effect). 

Given the abovementioned estimates of trends, and the delay in 
reducing emissions caused by the lifetime of existing emitting infra- 
structure’® or by social and political systems'’, and given China’s 
pledges in the global climate negotiations (http://unfccc.int/focus/ 
indc_portal/items/8766.php), it appears unlikely that the Chinese 
contribution to the global RF through greenhouse gas emissions 
will decrease in the coming years. This alone will increase China's 
contribution to global RF in the coming years. 

However, it is possible that China may choose to improve its air 
quality by reducing pollutant emissions. This improvement could 
happen through reduction of sulfur dioxide emissions, which would 
reduce the current cooling effect of sulfate aerosols (see Supplementary 
Information section A.3). Whether this would imply an even faster rise 
of China's contribution to the global RF would ultimately depend on 
the reduction of other (sometimes co-emitted) pollutants such as black 
carbon and ozone precursors'®!”, 

We found that China’s emissions contribute about 10% to the global 
RF in 2010. Such a percentage is commensurate with the Chinese 
population, size and economy (19% of the world population, 6% 
of global emerged land, and 12% of the global GDP in 2012). Our 
estimates of China’s contribution to global RF through its emissions 
of WMGHGs are consistent with previous results obtained with a 
simpler model!®. However, contributions from CH4 and LUC CO, 
shows high uncertainty, which reveals the need for more accurate 
emission inventories. 

Contributions from all SLCFs also showed high uncertainty, which 
has two causes. First, knowledge of emissions of those species is 
limited, especially of sulfur dioxide over China”®?!. To improve this 
we need better emission inventories, and improved monitoring net- 
works. Second, limited scientific understanding of aerosol-radiation 
interactions” and aerosol-cloud interactions*””* makes quantitative 
RE estimates of aerosols and SLCFs more uncertain, one reason why 
the latter interactions are not quantitatively assessed here. The three 
missing RF components of this study are qualitatively assessed in 
Supplementary Information section A.4. The inclusion of secondary 
organic aerosols and black carbon deposition on snow would slightly 
increase China's relative contribution. Inclusion of the aerosol-cloud 
interactions, however, would noticeably reduce China’s relative 
contribution to the present-day global RF because of an even stronger 
masking effect induced by Chinese aerosol emissions. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

This study’s first goal is to assess China’s relative contribution to present-day 
(that is, 2010) RE Its absolute contribution to present-day RF is then deduced by 
multiplying our estimates of relative contribution by the IPCC estimate of global 
RE’. Uncertainties are assessed through Monte Carlo ensembles (n = 50,000). 

To calculate China's relative contribution to the RF from WMGHGes, we use a 
simple model of global biogeochemical cycles named OSCAR v2.1 (refs 4 and 5). It 
is an update of the version 2.0 (ref. 25) where non-CO gases were added. OSCAR 
calculates the global concentration of CO2, CH4, N2O and halogenated com- 
pounds, such as SFs, NF3, many HFCs, PFCs, CFCs, HCFCs (see Supplementary 
Information) by balancing their historical anthropogenic emissions against the 
(nonlinear) natural removal processes that define the lifetime of each species. 
The representation of these natural processes has been calibrated on more com- 
plex (usually three-dimensional) models. A complete description of the model 
is provided in Supplementary Information section C. We used several data sets 
for the historical emission of WMGHGs, some of which have a specific focus 
on China'*?°?7, and we combined the corresponding simulations by OSCAR to 
obtain our best guess of China’s relative contribution (see also Supplementary 
Information section B.1). More details on the performance of OSCAR and 
the experimental set-up can be found in Supplementary Information sections 
B.2 and B.3. 

To calculate China’s relative contribution to the RF from SLCFs, we use a 
three-dimensional atmospheric-chemistry aerosol model, LMDz4-INCA3 (ref. 6). 
The RFs of aerosols and ozone are large over the emission region, but decay rapidly 
away from the region of emission. Hence, it is important that the modelled emis- 
sions, transformations and atmospheric removal processes of aerosols and ozone 
are spatially explicit. The model used in this study is 3.75° in longitude and 1.9° 
in latitude with 39 levels. A detailed model description can be found in ref. 6. The 
model is used to estimate China's relative contribution to RF in 1980, 2000 and 
2010. Note that for this latter year (2010) we use the most up-to-date emission data 
set that we know of”*. Performance of the model in reproducing observed surface 
concentration of aerosols is shown in Supplementary Fig. 7 and Supplementary 
Information section B.4.2. No significant bias is found over China or at the global 
scale. More details regarding the performance of the model and the data sets used 
are given in Supplementary Information sections B.4 and B.5. 

To calculate China's relative contribution to the LUC-induced albedo change and 
associated RF, we use a diagnostic method that combines present-day satellite data 
of surface albedo and snow cover with historical land-cover maps*. The albedo- 
induced RF is estimated as a net radiation anomaly at the top of the atmosphere, 
based on a simple (zero-dimensional) transfer model of shortwave radiation (see 
Supplementary Information section B.6.1). 

To isolate China’s relative contribution to the global RF in all those simula- 
tions, we use an attribution and linearization method consistent with the Brazilian 
Proposal (http://www.match-info.net/) and advised by the UNFCCC". The 
method is called ‘normalized marginal and it attributes to China a relative con- 
tribution to RF proportional to the marginal effect of its own emission caus- 
ing the RE To calculate China's relative contribution to the global RE, for each 
WMGHG or SLCEF and the LUC albedo, we performed three simulations: one 
‘normal with all emissions included in the simulation (‘all’); one with Chinese 
emissions reduced by a fraction € (‘“—€china ); and one with all except the Chinese 
emissions reduced by the same fraction (‘“—Egog; for rest of the globe). We deduce 
China’s relative contribution a following the normalized marginal method: 
a= [RF(all) — RF(—€china)]/[2 x RF(all) — RF(—€china) — RF(—€roc)]. Because 
of model specificities and species properties, different ¢ values are used: 0.1% with 
OSCAR, 20% with LMDz-INCA and 100% with the diagnostic method for LUC 
albedo (corresponding to the ‘normalized residual method). However, several 
studies have found only small differences in estimated contributions when using 
different ¢ values’®"!. 


Uncertainties are assessed in terms of the relative contributions of China, which 

implies that uncertainties in China's absolute contributions are obtained using Monte 
Carlo ensembles (1 = 50,000) in which we multiply the relative contributions by the 
global estimates of RF by the IPCC (see Supplementary Information section B.7). 
We assume that the relative uncertainty in China’ relative contribution to one of 
the components of the global RF is the product of two types of uncertainty. The first 
is the relative uncertainty in China's share of global emissions (see Supplementary 
Information section B.3.2 for WMGHGs and Supplementary Information section 
B.5.2 for SLCFs). It dominates the overall uncertainty. The second is the relative 
uncertainty in the radiative efficiency of China’s emission when compared to glob- 
ally averaged radiative efficiency (see Supplementary Information section B.3.3 for 
WMGHGs and Supplementary Information section B.5.3 for SLCFs). For WMGHGs, 
the difference in radiative efficiency comes from the difference in temporality of 
emission and the nonlinearity of the system; for SLCFs, it comes from the difference 
between local and globally averaged processes. For most species, the first type of 
uncertainty is assessed on the basis of the comparison of multiple data sets”, and the 
second type is on the basis of the comparison of multiple models”*"°. For some spe- 
cies, however, we made assumptions. Details of the uncertainty analysis are given in 
Supplementary Information section B.3 for WMGHGs, Supplementary Information 
section B.5 for SLCFs, Supplementary Information section B.6.2 for LUC albedo, and 
Supplementary Information section B.7 for the overall analysis. 
Data availability. Most of the input data used in this study are freely available 
online: CDIAC (http://cdiac.ornl.gov/trends/emis/meth_reg.html); EDGAR 
(http://edgar.jrc.ec.europa.eu/overview.php?v=42); EDGAR-FT (http://edgar. 
jrc.ec.europa.eu/overview.php?v=42FT2010); EDGAR-HTAP (http://edgar.jre. 
ec.europa.eu/htap_v2/index.php); ACCMIP (http://tntcat.iiasa.ac.at:8787/RcpDb/ 
dsd?Action=htmlpage&page=download; file “Historical emissions data (1850- 
2000)’); EPA (http://www3.epa.gov/climatechange/EPAactivities/economics/ 
nonco2projections.html); EDGAR-HYDE (http://themasites.pbl.nl/tridion/en/ 
themasites/edgar/emission_data/edgar-hyde-100yr/edgar-hyde-1-4.html); PKU-FF 
(http://inventory.pku.edu.cn/download/download.html); PKU-CH,g (for China 
only) (http://dods.extra.cea.fr/work/p24peng/PKU-CH4); RCP ODS emissions 
(http://www.pik-potsdam.de/~mmalte/rcps/); ECLIPSE (http://eccad.sedoo.fr/ 
eccad_extract_interface/JSF/page_login.jsf); EANET observations (http://www. 
eanet.asia/index.html); EPA observations (http://aqsdr1.epa.gov/aqsweb/aqstmp/ 
airdata/download_files.html); EMEP observations (http://ebas.nilu.no/); and Aero- 
Com observations (http://aerocom.met.no/cgi-bin/aerocom/surfobs_annualrs.pl). 
Code availability. All other input data are provided with the source code of 
OSCAR. The source code of LMDz-INCA can be downloaded at http://www- 
Isceinca.cea.fr/. We also provide the source code of OSCAR, used to estimate 
China’s contribution to WMGHGs and to perform the overall integra- 
tion and uncertainty analysis, along with any input data used in this study as 
Supplementary Data. 
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Reversal of ocean acidification enhances net coral 


reef calcification 
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Yana Nebuchina!, Aaron Ninokawa’, Julia Pongratz!°, Katharine L. Ricke’®, Tanya Rivlin’*, Kenneth Schneider!’, 
Marine Sesbotié!, Kathryn Shamberger!"'", Jacob Silverman, Kennedy Wolfe’, Kai Zhu!+ & Ken Caldeira! 


Approximately one-quarter of the anthropogenic carbon dioxide 
released into the atmosphere each year is absorbed by the global 
oceans, causing measurable declines in surface ocean pH, carbonate 
ion concentration ([CO;”~]), and saturation state of carbonate 
minerals (Q)!. This process, referred to as ocean acidification, 
represents a major threat to marine ecosystems, in particular marine 
calcifiers such as oysters, crabs, and corals. Laboratory and field 
studies”? have shown that calcification rates of many organisms 
decrease with declining pH, [CO3?~], and Q. Coral reefs are widely 
regarded as one of the most vulnerable marine ecosystems to ocean 
acidification, in part because the very architecture of the ecosystem 
is reliant on carbonate-secreting organisms’. Acidification-induced 
reductions in calcification are projected to shift coral reefs from 
a state of net accretion to one of net dissolution this century”. 
While retrospective studies show large-scale declines in coral, and 
community, calcification over recent decades® '”, determining the 
contribution of ocean acidification to these changes is difficult, if not 
impossible, owing to the confounding effects of other environmental 
factors such as temperature. Here we quantify the net calcification 
response of a coral reef flat to alkalinity enrichment, and show that, 
when ocean chemistry is restored closer to pre-industrial conditions, 
net community calcification increases. In providing results from 
the first seawater chemistry manipulation experiment of a natural 
coral reef community, we provide evidence that net community 
calcification is depressed compared with values expected for pre- 
industrial conditions, indicating that ocean acidification may 
already be impairing coral reef growth. 

The aragonite saturation state (Qarag) of tropical surface waters has 
decreased from about 4.5 in pre-industrial time’? to approximately 3.8 
by 1995 (ref. 14). In this study, sodium hydroxide (NaOH) was used to 
increase the total alkalinity of seawater flowing over a reef flat, with the 
aim of increasing [CO;7-] and Oarag Closer to values that would have 
been attained under pre-industrial levels of atmospheric CO; partial 
pressure (pco,). We used a dual tracer regression method to estimate 
changes in alkalinity uptake (that is, net community calcification) in 
response to alkalinity addition. This approach uses the change in ratios 
between an active tracer (alkalinity) and a passive tracer (a non-reactive 
dye, Rhodamine WT) to assess the fraction of added alkalinity taken up 
by the reef. Changes in the active tracer (alkalinity) result from mixing, 
dilution, and biological activity (that is, calcification), whereas changes 
in the passive tracer (hereafter referred to as the ‘dye’) are due solely to 
mixing and dilution. By comparing the alkalinity to dye ratios before 
(upstream of the study site) and after (downstream) the water mass 


interacts with the reef, we were able to isolate the change in alkalinity 
that is due to an induced increase in net calcification (Extended Data 
Fig. 1). 

Our study was conducted on One Tree Reef (23° 30’ S, 152° 06’ E), 
a pseudo-atoll in the southern Great Barrier Reef (Fig. 1a). One Tree 
Reef encloses three lagoons, two of which are hydrologically distinct 
(that is, separated by reef walls). At low tide, the water level drops below 
the outer reef crest, and the lagoons are effectively isolated from the 
ocean (Fig. 1c). Because First Lagoon sits approximately 30 cm higher 
than Third Lagoon, gravity-driven, unidirectional flow results from 
First Lagoon, over the reef flat separating the two lagoons, and into 
Third Lagoon. Our study site was situated along a section of the reef 
wall separating First and Third Lagoons. Unidirectional flow across this 
area of the reef flat persists for approximately 60 min following peak low 
tide, enabling an experimental setup depicted in Fig. 1d. This section of 
the reef flat is a well-developed, mixed reef community characterized 
by ~17% live coral (Extended Data Fig. 2). 

Our study was conducted once per day, over 22 days between the 
dates of 16 September 2014 and 10 October 2014. Dates, times, light 
data, and predicted heights of low tides are provided in Extended Data 
Table 1. Before low tide each day, a 15m? tank was deployed in First 
Lagoon, adjacent to the study site. On all 22 days, 4g Rhodamine WT 
were mixed with ambient seawater inside the tank. On 15 of those days 
(hereafter referred to as ‘experiment’ days), 15 mol (600 g) of NaOH 
was also introduced into the tank. The resulting solution was pumped 
onto the reef flat at a constant rate of ~21s~! for 60 min starting at the 
predicted time of low tide. The resulting plume flowed over the reef flat 
as described in the Methods. Following the 60 min pumping period, 
discrete water samples were taken at pre-defined sampling locations 
along the length of two parallel transects that defined the borders 
of the study area (along the upstream and downstream edges of the 
reef flat; Fig. 1d and Extended Data Fig. 3). Samples were analysed 
for total alkalinity, rhodamine, pH, dissolved inorganic carbon, and 
nutrients, as described in the Methods (Supplementary Table 1). On 
7 days, observations were made when dye, but no alkalinity, was added 
(hereafter referred to as ‘control days’), to test whether the dye addition 
had unexpected effects, and to characterize background variability in 
the study area. Mean chemical conditions for control and experimental 
days are provided in Fig. 2 and Extended Data Fig. 4. On experiment 
days, the mean concentration of added alkalinity in the central part 
of the plume (containing 50% of the dye), was 50.2 +2.7 pmolkg™!, 
resulting in an average elevation of Qarag in this part of the plume 
by 0.6 units. Mean temperatures, salinities, nutrient concentrations, 
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Figure 1 | Study site and experimental design. a, Map of Australia and 
aerial photograph of One Tree Reef with the study area denoted by an 
orange square. The map, sourced under Creative Commons CC, is freely 
available for commercial use. Use of the photograph was permitted under 
an Educational license from the University of Sydney. b, c, Cross-sections 


and dissolved oxygen concentrations are provided in Extended Data 
Table 2. 

Plots of the alkalinity and dye concentrations along the upstream and 
downstream transects illustrate the spatial distribution of the plume 
within the study area (Fig. 3a—-d). On control days, when dye but no 
alkalinity was added, these parameters were not correlated, and the 
mean alkalinity—dye slopes did not differ from zero (Fig. 3e). On these 
days, the difference in alkalinity between the upstream and downstream 
transects was due to background reef calcification and is represented by 
the difference in y intercepts. On experiment days, when alkalinity and 
dye were jointly introduced to the study site, these parameters were well 
correlated, resulting in positive, significantly non-zero alkalinity—dye 
slopes both for the upstream and for the downstream transects (Fig. 3f). 
On these days, background reef calcification is represented by the dif- 
ference in y intercepts (same as control days), and the fraction of added 
alkalinity taken up by the reef flat, fuptake. Was calculated as the dif- 
ference between the upstream and downstream alkalinity—dye slopes: 


(1) 


where ryp and rgown are the ratios (slopes) of alkalinity to dye for the 
upstream and downstream transects, respectively, inpmolkg~! ppb~! 
or mmol gl. Ata fixed rate of alkalinity and dye addition, Tup indi- 
cates the amount of added alkalinity entering our study site, while row, 
indicates the amount of added alkalinity leaving our study site. The 
difference in these two values indicates the amount of added alkalinity 
taken up by the reef community and was used to calculate the percent- 
age increase in net calcification according to equations (2)-(4). 

Data from all days were analysed using a multivariate regression 
approach to calculate alkalinity-dye ratios (slopes) and mean back- 
ground alkalinities (y intercepts) of the upstream and downstream 
transects, while simultaneously accounting for natural spatial and tem- 
poral variability (see Supplementary Information and Extended Data 
Figs 5-7). Mean alkalinity—dye slopes are presented in Fig. 4a. Results 
of a mixed-effects model indicate that upstream and downstream 
slopes are significantly different on experiment days but not control 
days, rejecting the null hypothesis that net community calcification 
did not respond to alkalinization (see Supplementary Information). 
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of the reef along the yellow line are shown for high (b) and low 

(c) tides, demonstrating the unidirectional flow from the upper lagoon 
(First Lagoon), over the reef flat study area, and into the lower lagoon (Third 
Lagoon) during low tide. d, Schematic of the study area (to scale) indicating 
the positioning of the transects and sampling locations (blue circles). 


The fractional uptake of added alkalinity was calculated according 
to equation (1) and averaged for all control and experimental days. 
Using this method, we estimate that an average of 17.3% + 2.3% 
(1 s.e.m.) of the experimentally added alkalinity was taken up by the 
reef community. 

The percentage increase in net calcification, AG, resulting from alka- 
linity addition was calculated as: 


AG = G increase /G background 


(2) 


where Gincrease is the additional calcification resulting from alkalinity 
addition in mmol s~!, and Gbackground is the background calcification 
in mmol s_! (that is, the calcification rate without added alkalinity). 
Gincrease ANd Ghackground Were calculated as 


Gincrease = Paye(Tup _ Téown) (3) 


G background = F(Alkup = Alk gown) (4) 


where Pjy. is the pumping rate of the dye in gs’, F is the volumetric 
flow rate in m? s~!, and Alkyp and Alkgown are the mean background 
alkalinities (that is, the y intercepts) of the upstream and downstream 
transects, respectively, in mmol m~? (see Supplementary Information). 
Using these equations, we estimate net community calcification 
increased by an average of 6.9% + 0.9% (Fig. 4b). A one-tailed, unpaired 
t-test indicates that the change in calcification on experiment days was 
significantly greater than control days (ty) = 1.981, P< 0.05). On the 
basis of laboratory and mesocosm studies!°, the mean response of coral 
calcification to a unit change in Qyrag is approximately 15%. Throughout 
the entire study area (inside and outside the plume), Oarag Was elevated 
by an average of 0.4 units, indicating a theoretical increase in coral 
calcification of 6%, which agrees closely with the observed increase of 
6.9%. Caution must be applied, however, when comparing calcification 
relationships derived from coral studies!> to mixed-reef communities 
such as that of our study site. 

The hypothesis that Qaag exerts strong control over coral reef calci- 
fication is supported by laboratory experiments and models!®!7 (but 
see ref. 18); however, isolating this control in a natural setting is com- 
plicated by the multiple drivers of calcification, which are often highly 
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Figure 2 | Chemical conditions for control (N= 7) and experiment 

(N= 15) days (mean + 1 s.e.m.). a, b, Aragonite saturation states (Qarag); 
c, d, pH. Error bars are indicative of day-to-day and hour-to-hour 
variability (not measurement error); estimated measurement errors are 
smaller than line thickness and are provided in the Methods. Experimental 
day standard errors are smaller than control day standard errors primarily 
because of the larger N. 

correlated (for example, production, Qarag, light, tmperature)®'?!?-**, 
Previous attempts to manipulate seawater chemistry in the natural envi- 
ronment were unable to demonstrate a causal relationship between sea- 
water chemistry and reef calcification”®. Further, retrospective studies 
documenting declines in coral reef calcification over the past several 
decades were unable to isolate the influences of various causal factors 
(for example, ocean warming, acidification, water quality, fishing pres- 
sure) owing to the confounding influence of co-varying parameters and 
a lack of reliable long-term carbonate chemistry observations””°. Our 
experimental approach demonstrates the influence of alkalinity (and 
Qarag) On net community calcification in a natural setting by uncoupling 
Qarag from otherwise co-varying confounding environmental factors 
(where ‘uncouple’ is used in the technical sense of ‘lack of correlation). 
We demonstrate that restoring [CO37] and Ouarag Closer to pre-indus- 
trial values enhances net community calcification, providing evidence 
that ocean acidification may have contributed to the documented 
declines in coral reef calcification® ” in the industrial era. 

Notably, ocean acidification is one of many stressors acting on coral 
reef calcification. Simultaneously to decreasing Q,;ag, sea surface tem- 
peratures have warmed by an estimated 0.4—0.8 °C (varying by region) 
since the early 1800s (ref. 27) which is posited to have increased cal- 
cification rates until a recent ‘tipping point”®. Identifying the relative 
contributions of various environmental factors, and how they interact, 
to the documented declines in coral reef calcification is complex yet 
essential to understanding how calcification will probably change in 
the coming decades. Further work, using methods developed here, 
could examine how coral reef response is affected by a variety of 
stressors (in isolation and combination) and duration of exposure, 
and help to assess geographic variability in sensitivity to ongoing ocean 
acidification. 

The Qarag of the tropical oceans is expected to continue declining 
from 3.8 to approximately 3.0 by the middle of the century and 2.3 by 
the end of the century". Deliberate alkalinization has been proposed 
as a geoengineering technique to offset ocean acidification impacts on 
coral reefs and other shallow marine ecosystems”’. Our results indicate 
that this approach could, in principle, help protect coral reefs from 
ocean acidification; however, the technical challenges associated with 
implementation would probably make it infeasible at anything but 
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Figure 3 | Relationships between alkalinity and dye for control (N= 7) 
and experiment (N= 15) days. a, b, Dye concentrations; c, d, alkalinities; 
e, f, alkalinity—dye slopes. e, On control days, when dye, but no NaOH, 
was added to the study site, these parameters are not correlated, and the 
resulting alkalinity—dye slopes are not significantly different from zero. 

f, On experimental days, dye and NaOH were jointly added to the study 
site, and the correlation between these parameters results in a positive, 
significantly non-zero slope. Mean alkalinity—dye slopes for control and 
experiment days are shown in Fig. 4a. 


highly localized scales (for example, protected bays, lagoons). Large- 
scale and long-term protection of marine ecosystems from the threat 
of ocean acidification depends on deep and rapid reductions in anthro- 
pogenic emissions of carbon dioxide*” 
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Figure 4 | Alkalinity-dye slopes and percentage change in net 
calcification for control (N= 7) and experiment (N= 15) days 

(mean + 1 s.e.m.). a, b, The difference between upstream and downstream 
slopes (a) was used to calculate the uptake of added alkalinity (equation 
(1)) and the percentage change in net calcification (b) (equations 

(2)-(4)). The reef community took up an average of 17.3% + 2.3% of the 
added alkalinity, implying a 6.9 + 0.9% increase in net calcification. The 
percentage change in calcification on experiment days was significantly 
greater than control days (one-tailed, unpaired f-test, t29 = 1.981, 

P < 0.05). Results by day are presented in Extended Data Fig. 7. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Concept. The dual tracer regression method developed here is an extension of 
ref. 31 and may have applications in other areas of research, such as nutrient or 
pollution assessments, uptake of industrial or agricultural waste, etc. The primary 
experimental criteria are that the active and passive tracers are added in a fixed 
ratio and at a fixed rate. The methods described here apply to situations where 
there is a dominant flow direction, dispersion or dilution, and a need to measure 
the effect of a reagent on community flux. 

Experimental setup. Before low tide each day, a 15 m? floating ‘header’ tank was 
partly submerged in First Lagoon, adjacent to the reef flat study site. The tank was 
gravity-fed with ambient seawater from the lagoon, and when necessary, a sub- 
mersible pump was used to completely fill the tank. Two marine grade bilge pumps 
(3,000 gallons per hour, Five Oceans) were secured inside the tank for mixing 
during chemical addition and to deliver the solution to the study site. On 22 days, 
4g Rhodamine WT (20g of a 20% solution, Turner Designs 10-108), dissolved in 
0.51 reverse osmosis (RO) water, was manually added to the tank over the course 
of ~30 min and mixed. On 15 of those days, 600 g (~15 mol) of NaOH, dissolved 
in 1.5] RO water, was also introduced into the tank. The solution inside the tank 
was subsequently mixed for an additional 30-45 min to ensure homogeneity. When 
a strong source of alkalinity is added to seawater, brucite forms as a solid precip- 
itate. However, for pH levels below ~9, brucite dissolves. On the basis of visual 
inspection and associated laboratory experiments, we estimated brucite dissolution 
to occur on the timescale of ~100s. Therefore, a mixing time of 30-45 min was 
sufficient to ensure complete dissolution of brucite; during this time, a handheld 
pH probe (Oakton) was used to manually check that the pH of the tank solution 
did not exceed 9.0. The tank was covered to avoid equilibration with the atmos- 
phere, but given that the tank was emptied over a period of 60 min, it is possible 
that air-sea fluxes were not completely avoided. While acid/base manipulations of 
seawater carbonate chemistry are not directly equivalent to the addition/removal of 
dissolved inorganic carbon (Cr), the differences in carbon speciation between acid/ 
base manipulations and CO) gas manipulations are minor if seawater is not allowed 
to equilibrate with the atmosphere (that is, in closed or continuous-flow systems). 
Further, it is infeasible to remove CO) from large volumes of seawater (>10,000 1) 
under field conditions. Therefore, acid/base manipulations are considered justified 
techniques to alter seawater carbonate chemistry in circumstances where large vol- 
umes of seawater are being manipulated, such as mimicking natural flow on coral 
reefs, particularly if the system is not allowed to equilibrate with the atmosphere™. 

The seawater solution from the tank (control days: seawater + dye; experiment 
days: seawater +_NaOH +dye) was pumped onto the reef flat for a period of 60 min 
starting at the predicted time of low tide (Extended Data Table 1) at a constant rate 
(~21s~'). The solution was introduced to the study site via the two bilge pumps 
that were submerged in the tank, connected to two lengths of (1.5-inch inner 
diameter) vinyl tubing that were secured to a cinder block located ~2 m upstream 
from the centre of the study site. Throughout the addition, the ratio of alkalinity to 
dye being added to the study site was assumed to be constant, and flow within the 
study site was considered to be in steady-state. On 30 September 2014 (a control 
day), dye was added using a peristaltic pump instead of the above configuration; 
this was because low tide occurred at 5:32, and assembly of the tank configuration 
was not possible in low-light conditions. 

Following a pumping period of 60 min, discrete water samples were taken at 
defined sampling locations along the length of two parallel transects that defined 
the borders of the study area: one along the upstream edge (adjacent to First 
Lagoon) and the other along the downstream edge (adjacent to Third Lagoon, see 
Extended Data Fig. 3). The total width of the reef flat in this area is approximately 
35 m, and the upstream and downstream transects were separated by ~25 m. The 
length of each transect was 32 m. Thus, the study area consisted of an approximate 
25m x 32m rectangle (800 m’). The upstream transect consisted of 9 sampling 
stations spanning the width of 32 m, and the downstream transect consisted of 15 
sampling stations spanning the width of 32 m. Sample locations were strategically 
assigned with a higher density near the centre of the study area to accurately char- 
acterize the shape of the resulting alkalinity and dye plume/curve. Spacing of the 
station locations is depicted in Extended Data Fig. 3. 

Following the 60 min pumping period, discrete samples were collected at each 
of the 24 sample locations by pumping reef water into 500 ml borosilicate glass bot- 
tles (Corning, 1500-500 Pyrex glass reagent bottle) using battery-operated liquid 
transfer pumps (Sierra Tools, model JB5684). To minimize chemical variation due 
to minor changes in sampling depth and/or location, precise sample locations were 
marked with plastic discs, nailed to the reef substrate. Samples were collected along 
the upstream and downstream transects simultaneously by five individuals, with 


each person sampling four or five locations. All samples were typically collected 
in less than 3 minutes, and it was assumed that the study site was in steady state 
during this time (that is, all fluxes and flows did not change during the 3-minute 
sampling interval). Samples were immediately returned to the One Tree Island 
Research Station, where they were subsampled and analysed for pH, total alkalinity 
(Ar), dissolved inorganic carbon (Cr), and rhodamine (see ‘Chemical Analyses’ 
section). For three upstream stations (—U16, U0, U16) and three downstream sta- 
tions (—D16, DO, D16), nitrate and ammonia concentrations were also determined. 
See Extended Data Fig. 3 for station locations. 

CTD (conductivity-temperature-depth) devices (YSI models 6600, 6920) were 
deployed at four sampling locations, two upstream (—U16, U0) and two down- 
stream (—D16, DO) for continuous measurements of seawater temperature, salinity, 
depth, and dissolved oxygen concentration. These instruments logged continuously 
at 2-minute intervals over the 22 study days. Discrete water samples (Corning, 
1500-250 Pyrex glass reagent bottle) were collected each day at each of the four 
CTD locations, and the dissolved oxygen concentration was measured using an 
automated potentiometric Winkler titration technique**. These values were used 
to verify CTD measurements. 

Alkalinity-dye slopes, r, and mean background alkalinities, 4, for each day were 
calculated using paired alkalinity and dye measurements that were collected across 
all sampling stations, transects, and days (see Supplementary Information). Over 
a 4-week period, we conducted our experimental protocol 23 times: 8 control 
days and 15 experimental days. One control day was omitted from subsequent 
analyses owing to intense rain that heavily influenced alkalinity measurements, 
resulting in 7 control days and 22 total days. This resulted in a total of 526 paired 
alkalinity and dye measurements that were used in the fitting procedure described 
in the Supplementary Information. Two previous expeditions to One Tree 
Island (September/October 2012 and March 2013) characterized site variability and 
allowed testing of the methods presented here. Preliminary data generated in these 
expeditions indicated that demonstrating statistical significance was dependent on 
maximizing signal (uptake of experimentally added alkalinity) to noise (natural/ 
background uptake of alkalinity). 

On experiment days, the difference between the upstream and downstream 
alkalinity-dye slopes indicates the fraction of experimentally added alkalinity 
that was taken up by the reef (equation (1) of the main text). We analysed the 
difference between slopes using a mixed-effects model in R (see Supplementary 
Information). Comparison of confidence intervals indicates that upstream and 
downstream slopes are significantly different on experiment days but not on con- 
trol days. Shapiro-Wilk W-tests were used to verify the underlying assumptions of 
normality (P > 0.05). The purpose of control days was to demonstrate that signifi- 
cant changes in alkalinity-dye slopes do not occur when NaOH is not added, and to 
characterize natural spatial and temporal variability in the study site. Further, with 
this study methodology, effectively, within experimental days, the part of the study 
site that is not affected by the alkalinity-rich plume serves as additional control for 
the part of the study site that is affected by the plume. 

While reef processes other than calcification can alter seawater alkalinity (for 

example, changes in nutrients, salinity), a previous study showed that changes in 
salinity and nutrients had a negligible effect on changes in alkalinity in coral reefs’. 
Salinity and nutrient data from our study are provided in Extended Data Table 2. 
Code availability. The Mathematica routine used to calculate alkalinity-to-dye 
ratios (slopes) and dye-free mean alkalinity estimates (y intercepts) for each day is 
provided in the Supplementary Information. 
Chemical analyses. Discrete samples were immediately returned to the laboratory 
on One Tree Island where they were analysed for pHyota, total alkalinity (Ar), and 
rhodamine, and subsampled for the later determination of total dissolved inor- 
ganic carbon (Cr), and nutrients (NH4*, NO», and NO3). All measurements and 
calculations were consistent with ‘best practices’ recommendations®. For 99.6% 
of station—-day combinations (24 stations x 22 days = 528 bottles), we successfully 
measured pHyota, At, rhodamine, and Cy, resulting in 526 paired measurements. 

Aragonite saturation state (Qarag), carbonate ion concentration ({CO37~]), and 
Pco, were calculated as a function of Az, pHotal, and in situ salinity and temperature 
using the program CO2SYS$**; dissociation constants for carbonate and boric acid 
were determined as in ref. 37 and as refitted in ref. 38, and the dissociation constant 
for boric acid was determined as in ref. 39. 

Parameters that were measured at a subset of sampling stations (that is, temper- 
ature, salinity, and dissolved oxygen measured at -U16, U0, -D16, DO; nutrients 
measured at -U16, U0, U16, -D16, D0, D16) are presented in Extended Data Table 2. 
Parameters that were measured (or calculated) across all sampling stations are 
presented in Fig. 2 (Qarag and pH) and Extended Data Fig. 4 (CO;-, Pco, and 
Cr). All chemistry data are included in Supplementary Table 1. 

Total alkalinity, Ay. Samples for Ar were pre-filtered using GF/F filters (Whatman) 
and analysed in triplicate using a Metrohm 855 Robotic Titrosampler (Metrohm 
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USA) using certified 0.1 N HCl (Fisher Chemical) diluted to a nominal concen- 
tration of 0.0125 N. Acid was calibrated by analysing certified reference material 
(CRM, batch 138) from A. Dickson's laboratory before each titration session (twice 
daily). Ay by volume (mol 1!) was converted to Ar by mass (jumol kg” !) by apply- 
ing a density correction using in situ salinities and temperatures. For each set of 
triplicate analyses, data points that were >10|1molkg~! away from the median were 
removed from the analysis; these outliers resulted from drop scale variability in 
sample delivery. The resulting mean and standard error were calculated for each 
sample location on each day. Instrumental precision from 55 analyses of CRM 
(batch 138) over the course of the study was <21mol kg! (1 s.d.). Alkalinities 
were normalized to the mean salinity of 35.75; salinity-normalized alkalinities 
were used for subsequent analyses. 

pH. Seawater pHiota was determined using an Ocean Optics spectrophotometer 
with 10 cm path length optical cells and m-cresol purple dye (Sigma Aldrich), 
following the methods of ref. 41. Water samples were kept in a temperature- 
controlled water bath (Thermo Scientific, Precision Microprocessor Controlled 280 
Series) at 25°C before analysis to minimize temperature-induced errors in absorb- 
ance measurements. The temperature of each sample was recorded immediately 
after analysis using a digital thermometer accurate to +0.05°C (VWR, Traceable 
Platinum Ultra-Accurate Digital Thermometer). CO2SYS** was used to calculate 
in situ pH values using in situ salinity and temperature measurements. Average 
precision from triplicate measurements for this system was less than 0.010 units 
(1 s.d.). CRM analyses (TRIS buffer, batch 22, A. Dickson) revealed that the system 
was accurate to within 0.005 pH units. 

Rhodamine WT. Rhodamine WT concentration was measured fluorometrically 
using a Turner 10AU fluorometer and 25 ml cuvettes. A series of eight standards 
was made by mass-diluting a 400 ppb (Parts per 10°) Rhodamine WT standard 
(Turner Designs) to 0, 0.5, 1, 2, 4, 16, 32, and 64 ppb. The standard curve was 
measured at the beginning, middle, and end of each measuring day to check for 
drift. Water samples were kept in a temperature-controlled water bath (Thermo 
Scientific, Precision Microprocessor Controlled 280 Series) at 25°C before anal- 
ysis to minimize temperature-induced errors in fluorescence. The temperature 
of each sample was recorded immediately after analysis using a digital thermom- 
eter accurate to +0.05°C (VWR, Traceable Platinum Ultra-Accurate Digital 
Thermometer). Rhodamine concentrations were temperature-corrected using 
the formula F, = F,exp(k(T; — T,)), where F, and F, are the fluorescence at the 
reference and sample temperatures, T, and T,, and k= 0.026/K, equating to a 2.6% 
correction per kelvin (ref. 40). Temperature corrections were applied before nor- 
malizing values to the standard curve. Dye concentrations were then normalized 
to the mean salinity of 35.75; salinity-normalized concentrations were used for 
subsequent analyses. Instrumental precision from triplicate measurements for this 
system was less than 0.1 ppb. 

Dissolved inorganic carbon (Cy). Cy samples were subsampled into 30 ml glass 
serum bottles (Wheaton, 223743), poisoned with 15 11 saturated HgCl, (0.05% by 
volume to inhibit biological activity"), sealed with rubber stoppers, crimped closed 
with aluminium caps, and transported to Stanford University for analysis. Samples 
were analysed approximately 3 months after sampling. Cy was extracted from sam- 
ples by acidifying them with phosphoric acid (H3POx, 5%) using a custom-built, 
automated acidification and delivery system (D. Mucciarone) using high-grade 
nitrogen as a carrier gas connected to an infrared gas analyser (Licor 7000). All 
samples were analysed in duplicate. The instrument was calibrated daily using 
CRM (Batches 141, 138), provided by A. Dickson. Immediate duplicate analyses 
of samples usually yielded instrumental precision of 1-2\1molkg”'. 

Nutrients. Nutrient samples were subsampled into 15 ml conical centrifuge tubes 
(Falcon). Ammonia samples were immediately frozen, and total ammonium 
concentrations (NH3-or = NH3-+NH,!*) were later determined using a modified 
fluorometric method”. Nitrate samples were preserved with 0.1 ml 1 N HCl, 
closed, shaken, and left in the dark at room temperature (~22°C) until transport 
to Eilat, Israel. Nitrite (NO2!~) was measured using a colorimetric method’, 
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with a Flow Injection Autoanalyzer (Lachat Instruments model QuickChem 
8500). Nitrate (NO3!~) was measured by reducing it to nitrite using a copperized 
cadmium column. Precision of ammonia, nitrite, and nitrate measurements was 
~ 0.05 jxmol 1”. Nitrite and nitrate in this study are reported as total oxidized 
nitrogen (TON = NO,!~ + NO! ). Results are provided in Extended Data 
Table 2. 

Salinity. Following the first 5 days of observations, it became evident that conduc- 
tivity measurements from three of the four CTDs proved unreliable; we believed 
this to be from the formation of oxygen bubbles on the sensors (resulting from 
high productivity on the reef flat). Therefore, starting on day six, discrete water 
samples were taken each day at each of the four CTD locations. Samples were stored 
in an air-conditioned, shaded room until transport to the Australian Institute of 
Marine Science for analysis on a Guildline Portasal Salinometer (model 8410A), 
with a precision of +0.0001 units. Accuracy was verified using CRM (OSIL, [APSO 
Standard Seawater, batch P155). For days without discrete salinity measurements 
(N=5), salinity values were calculated for the upstream transect by developing a 
linear relationship between the salinometer values and the reliable CTD. Salinities 
for the downstream transect were calculated from upstream values by applying 
an offset of 0.08 parts per thousand: this offset represents the mean increase in 
salinity between the upstream and downstream transects as a result of evaporation 
(Extended Data Table 2). 

Benthic community structure. Benthic surveys were conducted to characterize 
the underlying community structure of the study area. Five 25 m transects were 
laid on the reef flat perpendicular to the reef front, spaced approximately 8 m apart. 
Photographs were taken of 0.25 m? quadrats at 5m intervals. Photographs were 
analysed with Coral Point Count software with Excel extensions (CPCe) using 
25 random points per quadrat. The benthos was assigned to one of six categories: 
(1) live coral; (2) macroalgae; (3) turf algae; (4) Halimeda; (5) crustose coralline 
algae (CCA); and (6) sand/rubble. Where morphological forms of CaCO; (for 
example, rubble, CaCO; rock) were covered with biologically active groups (for 
example, turf, CCA), the biologically active group was scored. Results are provided 
in Extended Data Fig. 2. 


31. Friedlander, S. K., Turner, J. R. & Hering, S. V.A new method for estimating dry 
deposition velocities for atmospheric aerosols. J. Aerosol Sci. 17, 240-244 
(1986). 

32. Andersson, A. J. & Mackenzie, F. T. Revisiting four scientific debates in ocean 
acidification research. Biogeosciences 9, 893-905 (2012). 

33. Anderson, L. G., Haraldsson, C. & Lindegren, R. Gran linearization of 
potentiometric Winkler titrations. Mar. Chem. 37, 179-190 (1992). 

34. Kinsey, D. Alkalinity changes and coral reef calcification. Limnol. Oceanogr. 23, 
989-991 (1978). 

35. Riebesell, U., Fabry, V. J., Hansson, L. & Gattuso, J. P. Guide to Best Practices for 
Ocean Acidification Research and Data Reporting (Publications Office of the 
European Union, 2010). 

36. Lewis, E. & Wallace, D. W. R. Program developed for CO2 system calculations 
(US Department of Energy Oak Ridge National Laboratory, 1998). 

37. Mehrbach, C., Culberson, C. H., Hawley, J. E. & Pytkowicz, R. M. Measurement 
of the apparent dissociation constants of carbonic acid in seawater at 
atmospheric pressure. Limnol. Oceanogr. 18, 897-907 (1973). 

38. Dickson, A. G. & Millero, F. J. A comparison of the equilibrium constants for the 
dissociation of carbonic acid in seawater media. Deep-Sea Res. 34, 1733-1743 
(1987). 

39. Dickson, A. G. Thermodynamics of the dissociation of boric acid in synthetic 
seawater from 273.15 to 318.15 K. Deep-Sea Res. 37, 755-766 (1990). 

40. Wilson, J. F. in Techniques for Water Resources Investigations of the U.S. 
Geological Survey, Book 3, Ch. Al2 (U.S. Government Printing Office, 1968). 

41. Dickson, A. G., Sabine, C. L. & Christian, J. R. Guide to Best Practices for Ocean 
CO2 Measurements (North Pacific Marine Science Organization, 2007). 

42. Holmes, R. M., Aminot, A., Kerouel, R., Hooker, B. A. & Peterson, B. J. A simple 
and precise method for measuring ammonium in marine and freshwater 
ecosystems. Can. J. Fish. Aquat. Sci. 56, 1801-1808 (1999). 

43. Grasshoff, K., Kremling, K. & Ehrhardt, M. (eds) Methods of Seawater Analysis 
(Wiley-VCH, 1999). 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


-o- Upstream -- Downstream 
@ HO: 1= Town BB T Rasa 
) ° © 
° ° 
> [Alk] = r,,*[Dye]+4,, > [Alk] = r,,*[Dye]+4,, 
P= he = 
<x 2 <i, rn 
[AIK] = Foun LDVE]+A8 jon zat [AIK] = Fon LDVE]+A8 joyn 


Dye concentration Dye concentration 


Extended Data Figure 1 | Theoretical representations of the null, HO, 
and alternative, H1, hypotheses. a, In HO, the reef does not take up 

added alkalinity; here, the change in alkalinity between the upstream 

and downstream transects would not be systematically related to the dye 
concentration, and the ratio of the alkalinity—dye relationship, r, would not 
be expected to change between the upstream and downstream locations 
(that is, rup = rdown)- b, In H1, reef uptake of added alkalinity occurs; 

here, areas with more alkalinity (and more dye) change at a different rate 
than areas with less alkalinity (and less dye), resulting in a change in the 
alkalinity—dye slope (that is, rup > Taown)- 
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Extended Data Figure 2 | Community composition of the reef flat study 
area. Percentage cover by benthic type is as follows: crustose coralline 
algae (39%), live coral (17%), turf algae (16%), macroalgae (19%), 
sand/rubble (9%), and Halimeda (5%). 
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Extended Data Figure 3 | Schematic of study area showing meter- 
spacing of station locations for the 9 upstream (U) stations and 15 
downstream (D) transects. Numbers indicate the metre-spacing from the 
centre of the study area, denoted as U0 for the upstream transect and DO 
for the downstream transect. The outermost sampling locations for the 
upstream (—U16, U16) and downstream (—D16, D16) transects define the 
four outermost corners of the study area and were strategically positioned 
to lie outside the alkalinity-dye plume, rendering zero dye concentrations 
and added alkalinity. 
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Extended Data Figure 4 | Mean chemical conditions for control (N=7) 
and experiment (N= 15) days. a, b, Carbonate ion concentrations 
([CO3*~]); ¢ d, pco,; e f, dissolved inorganic carbon concentrations (Cr) 
for upstream and downstream transects. Error bars, which represent 
standard errors, are indicative of day-to-day and hour-to-hour variability 
(not measurement error); estimates of measurement error are provided in 
the Methods. Total alkalinity (Ar), dye concentration, aragonite saturation 
state (Qarag), and total pH (pH) are provided in Figs 2 and 3. 
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Extended Data Figure 5 | Comparison of alkalinity values before and after ‘offset-corrections’ used in the multivariate regression analysis. 
a, b, Measured (that is, ‘raw’) alkalinity values. c, d, ‘Offset-corrected’ alkalinity values. Bold lines represent average conditions; dashed lines show results 
by day. See Supplementary Information. 
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Extended Data Figure 6 | Results of the multivariate regression analysis. | and downstream transects. g, h, Mean background alkalinities by day, 


a, b, Unique offsets by station, x,, for the upstream and downstream 4a, for upstream and downstream transects. Error bars represent standard 
transects. c, d, Magnitude of offsets by day, ya, for upstream and errors. See Supplementary Information. 
downstream transects. e, f, Alkalinity-dye ratios by day, rg, for upstream 
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Extended Data Figure 7 | Results of the multivariate regression were 
used to calculate the additional alkalinity uptake (that is, Gincrease) and 


background alkalinity uptake (that is, Gpackgrouna) by day. a, Fraction Information. 
of added alkalinity taken up by the reef by day, given by (1 - (Taown/Tup)> 
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equation (1) of main text). b, Background reef uptake by day, given by 
(4a, up — 4d, down)- Error bars represent standard errors. See Supplementary 
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Extended Data Table 1 | Schedule for control and experiment 
days, including date, time, predicted height of low tide, and mean 


photosynthetically active radiation (PAR) for the 1h study period 


Date 


16 Sept 
17 Sept 
18 Sept 
19 Sept 
20 Sept 
21 Sept 
24 Sept 
25 Sept 
26 Sept 
27 Sept 
28 Sept 
30 Sept 
01 Oct 
02 Oct 
03 Oct 
04 Oct 
05 Oct 
06 Oct 
07 Oct 
08 Oct 
09 Oct 
10 Oct 


Low Tide 


(HHMM) 


0737 
0907 
1026 
1125 
1210 
1249 
1425 
1455 
1528 
1604 
1647 
0532 
0639 
0809 
0940 
1051 
1150 
1240 
1327 
1411 
1455 
1539 


Water Depth 


(m) 


0.88 
0.93 
0.85 
0.72 
0.59 
0.47 
0.29 
0.30 
0.35 
0.44 
0.58 
0.57 
0.75 
0.84 
0.80 
0.65 
0.46 
0.30 
0.19 
0.15 
0.19 
0.29 


PAR 


(umol m? s') 


1146 
1626 
1797 
1800 
1715 
1215 
931 
675 
551 
297 
25 
277 
866 
1459 
1055 
1916 
1636 
1707 
1453 
1179 
787 
513 


Type of Study 


Control 
Experiment 
Experiment 
Experiment 

Control 
Experiment 
Experiment 
Experiment 
Experiment 
Experiment 
Experiment 

Control 

Control 
Experiment 
Experiment 

Control 
Experiment 
Experiment 

Control 
Experiment 
Experiment 


Control 


Tides were provided courtesy of One Tree Research Station. The low-tide time represents 

the time at which pumping onto the reef started; sampling occurred 60 min afterwards. 

Data for photosynthetically active radiation were obtained from the Australian Institute of 
Marine Science weather station at One Tree Island (http://data.aims.gov.au/aimsrtds/station. 
xhtml?station=131). There was no significant difference between mean light levels for control 
(1,166 +217 mol m-2 s~!, mean +s.e.m.) and experiment (1,098 + 150 pmol m~? s~!) days. 
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Extended Data Table 2 | Mean (+1 s.e.m.) values for temperature (7), salinity (S), ammonium (NHy), nitrite and nitrate (NO2 + NO3), and 
dissolved oxygen (DO) during the 22-day study period 


Control Experiment 
Upstream Downstream Upstream Downstream 
T CC) 23.0+0.3 23.540.5 23.34 0.2 23.740.3 
s 35.79 + 0.02 35.84 + 0.02 35.71 + 0.02 35.80 + 0.02 
NH4 (yumol L’) 0.40+ 0.02 0.23 + 0.02 0.37 + 0.01 0.24 + 0.01 
NO2 +NO3 (umol ES) 1.144 0.04 0.80 + 0.02 1.08 + 0.02 0.78+0.01 
DO (mg L") 5.94 0.2 7.1404 6.2+0.1 7.34 0.1 


Note that underlying natural variability (that is, day-to-day, hour-to-hour) contributes to standard errors; measurement errors for each parameter are indicated in the Methods. 
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Vast assembly of vocal marine mammals from 
diverse species on fish spawning ground 


Delin Wang!, Heriberto Garcia!, Wei Huang!, Duong D. Tran!, Ankita D. Jain?, Dong Hoon Yi’, Zheng Gong!”, J. Michael Jech?, 


Olav Rune Godo’, Nicholas C. Makris? & Purnima Ratilal! 


Observing marine mammal (MM) populations continuously 
in time and space over the immense ocean areas they inhabit is 
challenging but essential for gathering an unambiguous record 
of their distribution, as well as understanding their behaviour 
and interaction with prey species'~°. Here we use passive ocean 
acoustic waveguide remote sensing (POAWRS)”* in an important 
North Atlantic feeding ground®”” to instantaneously detect, 
localize and classify MM vocalizations from diverse species over 
an approximately 100,000 km? region. More than eight species 
of vocal MMs are found to spatially converge on fish spawning 
areas containing massive densely populated herring shoals at 
night-time!!-'¢ and diffuse herring distributions during daytime. 
We find the vocal MMs divide the enormous fish prey field into 
species-specific foraging areas with varying degrees of spatial 
overlap, maintained for at least two weeks of the herring spawning 
period. The recorded vocalization rates are diel (24h)-dependent 
for all MM species, with some significantly more vocal at night and 
others more vocal during the day. The four key baleen whale species 
of the region: fin, humpback, blue and minke have vocalization 
rate trends that are highly correlated to trends in fish shoaling 
density and to each other over the diel cycle. These results reveal 
the temporospatial dynamics of combined multi-species MM 
foraging activities in the vicinity of an extensive fish prey field that 
forms a massive ecological hotspot, and would be unattainable with 


conventional methodologies. Understanding MM behaviour and 
distributions is essential for management of marine ecosystems 
and for accessing anthropogenic impacts on these protected marine 
species! 51718, 

Here we present the ecosystem-wide spatial distributions of vocal 
MMs from multiple cetacean species, including both mysticetes and 
odontocetes, acquired simultaneously with that of their fish prey dur- 
ing their feeding season in the Gulf of Maine (GOM), an important 
North Atlantic MM foraging ground®”® (Figs 1 and 2). The GOM is 
among the more diverse, productive, and trophically complex marine 
temperate areas in the world, with Atlantic herring (Clupea harengus) 
comprising a keystone prey species, common in the diets of many 
marine mammals, piscivorous fish and seabirds of the region®. Using 
a large-aperture, densely sampled, coherent hydrophone array with 
orders of magnitude higher array gain’? than previously available in 
acoustic marine mammal sensing, we could detect, localize and clas- 
sify vocalizing MMs from multiple species instantaneously over an 
approximately 100,000 km? region by POAWRS”* without aliasing'® 
in time and space (Fig. 3a, Methods and Extended Data Figs 1-5). 
Simultaneous fish distributions were acquired instantaneously over tens 
of thousands of square kilometre areas by ocean acoustic waveguide 
imaging (OAWRS)’'"!*, combined with conventional fisheries ultra- 
sonic echosounding’» and fish trawl sampling’* to obtain thousands 
of calibrations at statistically significant locations'*. We mapped the 
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Figure 1 | Full diel cycle distributions of MM 
vocalizations and fish. Vocalizing MMs from 
diverse species are convergent on spawning herring 
distributions in autumn 2006 (26 September to 6 
October). Dense Atlantic herring shoals (>0.2 fish 
per m’, red shaded areas) imaged using OAWRS 
system]! and diffuse herring populations 
(approximately 0.053 fish per m’, bounded by 
magenta line) obtained from conventional fish 
finding sonar'>!°. The MM call rate densities in 
units of number of calls per minute per 25 nmi” 
((min)(5 nmi)*) measured by POAWRS have 

peak values « indicated. Detailed shoaling 

herring and MM species vocalization spatial 
distributions can be found in Extended Data Fig. 6 


MM call rate:density 
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T 
-67.5 
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and Supplementary Information section III, 
respectively. The bathymetric data (contours shown 


((min)(6nmi)?)) in grey) are obtained from the US National Centers 
7 for Environmental Information. 
ma | 
0.01 o i 
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lLaboratory for Ocean Acoustics and Ecosystem Sensing, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 02115, USA. @Laboratory for Undersea Remote Sensing, 
Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, USA. ?Northeast Fisheries Science Center, 166 Water Street, Woods Hole, Massachusetts 
02543, USA. “Institute of Marine Research, Post Office Box 1870, Nordnes, N-5817 Bergen, Norway. 


366 | NATURE | VOL 531 | 17 MARCH 2016 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Day 
i i i i i _ i 
== Fin, a = 0.82 fo S960" 3 
1 - 
— Humpback, 0 = 0.29 “ a . 
a _ odgers ~~ 4 9 7 
405-4 Minkey a= 0.43 pais Gearges | i 
— Sei,.a= 0.12 © eu Basin ~S60, 
== Blue, «= 0.18 : 
= = Odontocetes, 7=0.15 » 
== Herring 
XH M. 
2 42 4. 
@ 
ce} 
2 
3 
i 
4154 SS i MM call. rate density,- 
( Georges Bank (number of calls/ 
A r ((min)(Snmi)?)) 
va ° G 
- LNT 0.06 o rl 
A Great-36ut . — Oe 
| innel 4 ~ 0.002 G 
T T T T T T : T 
b Night 
i i i 1 i —| i 
== Fin, o = 0.82 7 200 ~ s 
= Humpback, o = 0.29 a 
gos] Minke, a= 0.48 Georges |. 
= Sei, «= 0.12 Basin 
== Blue, w= 0.18 
== Odontocetes, «= 0.15 
| == Herring i 
Zz 42 so) e ESS) 
ms a" 
se} 
2 
iG 5 
4 7 
Tf a Hohe 
415-4 oes ‘S. ; \ MM call rate density 
id 1 a (number of calls/ 
& 1 a ((min)(Snmi)?)) 
By we a 
wm - 
2" of te 7 0.06 a bel 
aya or ~_S ! 
Bf >, Great South !\ 0.01 & Rie 
ee fnel 0.002 « a 
T T T T T T T 
-69.5 -69 -68.5 -68 -67.5 -67 -66.5 


Longitude (° E) 


Figure 2 | Day and night distributions of MM vocalizations and fish. 

a, b, Daytime (a) and night-time (b) distributions of vocalizing MMs from 
diverse species. Regions with dense herring populations (>0.2 fish per m’, 
red shaded areas) during day and night are shown. The daytime hours are 
between sunrise and sunset (06:00 to 18:00 EDT), while night-time hours 


vocalization temporospatial distributions of five baleen whale species 
(Mysticeti): blue (Balaenoptera musculus), fin (Balaenoptera physalus), 
humpback (Megaptera novaeangliae), sei (Balaenoptera borealis) and 
minke (Balaenoptera acutorostrata), as well as toothed whale species 
(Odontoceti): sperm (Physeter macrocephalus), pilot (Globicephala 
spp.), orca (Orcinus orca) and several other delphinid species. We exam- 
ined whether these vocal MMs had species-specific spatial affinities 
and how they varied across the prey field over the diel cycle. We further 
investigated the extent to which MM vocalization behaviour for each 
species is correlated to fish shoaling density which had recently been 
shown to be a major factor driving humpback vocal behaviour in their 
feeding ground’. 

The overall vocalization rate spatial distributions of the five 
baleen whale species and the toothed whales are found to be highly 
focused on dense herring shoaling areas'*"'* on the northern flank of 


are between sunset and sunrise the next day (18:00 to 06:00 EDT). All 
other contours are similar to Fig. 1. The bathymetric data (contours shown 
in grey) are obtained from the US National Centers for Environmental 
Information. 


Georges Bank, and almost entirely (more than 90% of the calls) con- 
tained within the region of at least diffuse herring aggregations'>'® 
that extends over a roughly 12,000 km? area between Georges and 
Rodgers Basins in the north, Georges Bank in the south, Northeast 
Channel to the east, and Great South Channel to the west (Figs 1 
and 2). The dense herring shoals!*~'* are characterized by 0.2 fish 
per m? at shoal boundaries to over 10 fish per m? at shoal centres, fac- 
tors 4 to 200 times greater than those of the diffuse aggregations!" 
that are characterized by roughly 0.053 fish per m* (Methods, 
Extended Data Fig. 6). The cumulative call-rate distribution for all 
MM species fall off rapidly with increasing range from the dense 
herring shoals (Fig. 3b). A significant majority of MM vocalizations 
from any species, at least 63% corresponding to the e-folding decay 
value of the cumulative call-rate distribution in range, originate in 
areas that either completely overlap with or lie within 3-8 km range 
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Figure 3 | POAWRS MM detection region and cumulative nocturnal 
MM call rate distribution. a, POAWRS 0.5 probability of detection (POD) 
contour for MM vocalizations in the Gulf of Maine. Dotted box is the 
region shown in Figs 1 and 2. b, Cumulative nocturnal MM vocalization 
rate distribution as a function of minimum distance from nocturnal 
herring shoaling areas. The e-folding distances of the cumulative 


of dense herring shoals. These e-folding distances are less than 
the size of the herring shoals and can be traversed by most MMs 
within several minutes to at most an hour, timescales small relative 
to the roughly 12h duration of the herring shoal’s nocturnal existence. 
In contrast, the zooplankton distribution in the area of intense MM 
vocalizations is diffuse, with volumetric densities (715 +550 zoo- 
plankton per m?, Supplementary Information section V) roughly nine 
times smaller than those found in areas where baleen whales have 
been previously observed to be actively feeding*”*' on zooplankton 
(6,500 + 2,000 zooplankton per m*). From an energetics perspective, 
the dense herring populations, with biomass ranging from 5gm~* 
to over 250 gm~°, provide orders of magnitude more efficient prey 
than the diffuse zooplankton of approximately 0.6+0.5gm~° 
biomass. The spatial relationship quantified here between regions 
of dense MM vocalization rate densities and those of dense fish shoal- 
ing densities are consistent with a mechanism by which MMs from 
around the GOM converge on localized dense prey fields that are 
uniquely available during the annual herring spawning season for 
efficient feeding. 

We find the vocal MMs are not uniformly distributed over the prey 
field, but concentrated in species-specific population centres with 
varying degrees of spatial overlap that can vary over the diel cycle 
depending on the species (Figs 1 and 2 and Supplementary Information 
section III). The dominant MM consumers of GOM herring have been 
proposed?’ to be fins and humpbacks, accounting for close to 50% of 
the estimated 150 + 50 kilotons of herring consumed annually by MM 
predators, with the remaining consumed by other MM species, such as 
minkes and various delphinids. MMs are responsible for 50% of total 
herring consumption by non-human predators’. The Atlantic herring 
that spawn on northern Georges Bank comprise roughly 15% to 25% 
of the GOM Georges Bank complex herring stock'®. These images 
reveal the previously uncharted spatial affinities of multiple MM spe- 
cies simultaneously engaged in foraging activities in the vicinity of an 
enormous fish prey field. The ability to observe predator distributions 
with respect to changing prey fields is essential for advancing our 
understanding of ecosystem processes*”??”?, 

The vocalization repertoire of MMs is highly species dependent, 
varying in acoustic frequency content, duration, and repetition rate 
(see Methods for MM species vocal classification). Of the mysticete 
whales, the fins are found to be the most vocal group with the high- 
est measured vocalization rates of roughly 14,000 calls per day, fol- 
lowed by humpbacks with roughly 2,000 calls per day. The measured 
vocalization rates for all MM species are provided in Extended Data 
Table 1. MMs vocalize for a variety of purposes***?-”? that include 
songs associated with mating in mysticete whales, social sounds 
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nocturnal MM vocalization rate distributions range from 0-8 km 
depending on the species. In contrast, the azimuthally-averaged POAWRS 
MM POD e-folding distances from nocturnal herring shoaling areas are 

a factor of 10 to 100 times larger. The bathymetric data (contours shown 
in grey) are obtained from the US National Centers for Environmental 
Information. 


associated with inter- and intragroup interactions, signals for long 
range or night-time acoustic communication, calls for coordi- 
nated movement during feeding or migration, and echolocation 
for prey-finding in odontocetes that may possibly extend to certain 
mysticete whale species (Methods). 

Determining the temporal and spatial variations in MM vocaliza- 
tions is important for understanding their natural behaviours*”? and 
also crucial for providing controls when assessing anthropogenic effects 
on that behaviour’. All five baleen whale species and the toothed whales 
analysed here have vocalization rates that vary over the diel cycle. Four 
of the five baleen whale species, fin, humpback, blue and minke, are 
found to be more vocal at night (from dusk 18:00 Eastern daylight 
time (EDT) through to dawn 06:00 EDT the next day) with night-time 
vocalization rates a factor of between 2 to 10 times that of the daytime 
(Extended Data Table 1). In contrast, the sei whales and collectively, the 
odontocetes, are more vocal in the daytime by a factor of roughly 1.25. 
The vocalization rate spatial distribution is strongly diel-dependent for 
some MM species, but less so for others (Fig. 2a, b). Combined sensing 
with POAWRS and OAWRS’”*!!"!4 enables MM and fish distributions 
to be simultaneously monitored over instantaneous continental-shelf 
scale regions, and presents a significant advantage in areal coverage 
over conventional line-transect visual'®*° and ultrasonic echosounding 
survey techniques respectively. 

We correlated the MM vocalization rate time series with the herring 
shoaling areal population density time series over the diel cycle as a 
function of MM species (Fig. 4, Extended Data Fig. 7 and Extended 
Data Table 1). Four of the five vocal baleen whale species, fin, hump- 
back, minke and blue, are found to have temporal call rate trends that 
are highly correlated to temporal trends in herring shoaling density 
and also to each other. For humpbacks, their night-time vocalizations 
are dominated by downsweep ‘meows, ‘feeding cries’and ‘bow-shaped’ 
calls that are associated with fish-feeding activities, occur ten times 
more frequently at night than in the daytime accounting for the high 
correlation (ryq,fish = 0.87) to nocturnal herring shoaling densities’. 
The high temporal correlation (rymfish = 0.82) obtained here between 
fin vocalization rate and herring shoaling density indicates that the 
factor of 2 increase in the characteristic 20 Hz fin vocalizations at 
night is likely associated with increased fish-feeding activities. This is 
supported by the spatial focusing of the fin vocalization distribution 
on north-central Georges Bank in the location of dense herring shoal- 
ing populations during night-time (Fig. 2). The minke vocalization 
rates are well correlated to herring shoaling densities, both temporally 
(TM,fish = 0.64) and spatially. The minke call rate distribution spatial 
overlap with dense herring shoaling areas increases from 0% in the 
day to 71% at night (Fig. 3b and Extended Data Fig. 8). This implies 
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Figure 4 | Diel MM call rate and herring shoal areal population density 
time series. a-d, Mean call rates for blue (a), fin (b), humpback (c), and 
minke (d) whales are correlated to Atlantic herring shoal mean areal 
population density over the diel cycle. The error bars indicate standard 


that the fivefold increase in minke buzz vocalization sequences 
recorded here, which resemble the odontocete echolocation click 
sequences (compare Extended Data Fig. 1d, e), play an important 
role in minke fish-feeding activity. Here we find the blue vocaliza- 
tion rates have the highest temporal correlation (ryM,fish = 0-91) to 
herring shoaling densities. The blue whale calls we detected are com- 
prised predominantly of short-duration (approximately 2 s) audible 
downsweep (type D”*) calls, which were previously observed for blue 
whales in both the North Atlantic” and Pacific feeding grounds’ and 
regarded as contact calls. The high temporal correlation between blue 
whale call rates and herring densities obtained here suggest that it 
may be possible that blue whales are consuming herring and the type 
D vocalizations are feeding-related, though this fish-feeding behav- 
iour has not been observed for blue whales. (Other possible explana- 
tions are discussed in Methods.) Previous observations of sei whales 
in the GOM found lower call rates at night for their downsweep chirp 
signals attributed to zooplankton-feeding and higher call rates in the 
daytime while engaged in social interactions and less feeding”’. Here 
we find a similar pattern for their vocalization behaviour in the vicin- 
ity of dense herring shoals, which may result from more fish-feeding 
at night than in daytime. The tooth whales collectively have vocaliza- 
tion rates that are temporally uncorrelated to herring shoaling densi- 
ties, perhaps because their fish-feeding strategies differ from those of 
baleen whales. The toothed whales hunt small fish groups and feed 
on individual fish?*?°, whereas baleen whales engulf large quantities, 
hundreds to thousands of fish, all at once** and rely on fish shoaling 
together for efficient feeding. 

The ecosystem-scale predator-prey behaviour documented here is 
expected to be a regular feature of the annual herring spawning season 
on Georges Bank and an important aspect of the life cycles of the vari- 
ous species involved that is likely to be found in other ocean ecosystems 
where many of the same conditions occur. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

POAWRS receiver array recordings of MM vocalizations. Acoustic recordings 
were acquired using a 160 hydrophone-element horizontal receiver line-array*! 
towed by a research vessel along designated tracks north of Georges Bank from 
19 September to 6 October 2006 (GOM 2006 experiment)!!"™, coinciding with 
the annual herring spawning period'*!° on the northern flank of Georges 
Bank®?34, Here we use data from all 160 hydrophone elements nested'*7! into 
four subapertures, where each subaperture contains 64 hydrophones for spatially 
and temporally unaliased sensing up to 4 kHz (sampling rate of POAWRS was 
8 kHz)*°. Acoustic pressure-time series measured by sensors across the receiver 
array were converted to two-dimensional (2D) beam-time series by time-domain 
beamforming'®*®, and further converted to spectrograms by temporal Fourier 
transform. MM vocalizations were automatically extracted using a threshold 
detector (>5.6 dB signal-to-noise ratio (SNR)) with a detection rate of roughly 
87% + 5% depending on the species, call characteristics and in-beam ambient 
noise levels. The beamformed spectrograms were subsequently checked by visual 
inspection to improve expected detection accuracy to over 92%. Examples of 
typical vocalization spectrogram from diverse MM species are shown in Extended 
Data Fig. 1. 

The high gain’? of the densely-sampled large aperture coherent POAWRS 

receiver array used here, up to 10 logig n= 18 dB gain where n= 64 hydrophones 
for each sub-aperture, enabled detection of whale vocalizations either two orders of 
magnitude more distant in range or lower in SNR than a single hydrophone which 
has no array gain (Extended Data Fig. 2). The angular resolution of the receiver 
array!” is dependent on the measured bearing, array aperture length and acoustic 
wavelength, tabulated in Table 1 of ref. 13 for select frequencies. 
Characteristics and function of MM vocalizations from diverse species. The 
time-frequency characteristics of each MM vocalization was extracted via pitch 
tracking*”* and applied to classify the vocalization according to species (Extended 
Data Fig. 3 and Extended Data Table 2). A combination of extracted features, 
orthogonalized via principle component analysis (PCA), were used to optimize 
the vocalization species classification employing k-means and Bayesian-based 
Gaussian mixture model clustering approaches". The bearing-time trajectories 
of each closely associated series of vocalizations were also taken into account to 
ensure consistent classification, and to minimize the automatic classification error 
to between 0.5% to 7% depending on the species. 

In the low frequency range from 10 Hz to 100 Hz, the acoustic spectra were 
dominated by fin, blue and sei vocalizations (Extended Data Fig. la—c). The fins 
were identified from their characteristic 20 Hz centre frequency calls*!~“4 that have 
been associated with communication among fin individuals* and also found to be 
uttered by males as breeding displays in their mating grounds“. Given the large 
volume of fin vocalizations measured here in the vicinity of dense shoaling fish 
populations, averaging 14,000 calls per day, these 20 Hz calls can also be associated 
with feeding behaviour, serving as communication signals or for coordination 
among individuals in their foraging ground. The blues were identified from their 
audible downsweeps — type D calls, burps and grunts”°47~°°, previously found 
to be vocalized by both sexes, regarded as contact or social calls** produced by 
individuals at shallow depths*” of 10-40 m. The seis were identified from their 
downsweep calls occurring singly or as doublets with roughly 4 s separation, and 
also sometimes as triplets*»°'*’, hypothesized to be long-range contact calls poten- 
tially enabling coordinated activities such as feeding or breeding”. 

The spectra in the mid frequency range from 100 Hz to 1,000 Hz were domi- 
nated by minke and humpback vocalizations. The minkes were identified from 
their buzzes comprised of a series of high and low frequency click sequences**°°, 
has characteristics similar to the highly repetitive pulse train of odontocetes that 
may be suitable for prey echolocation. Compared to other baleen whales, the 
humpbacks have a fairly extensive vocalization repertoire. Here the humpbacks 
were identified from their songs*°°, as well as non-song calls*”>* with charac- 
teristics provided in ref. 7. Male humpbacks vocalize songs which are patterned 
sequences of calls as breeding displays‘ in their mating ground, and have been 
observed to carry the tunes into their feeding grounds*~*! (Extended Data 
Fig. 2). The non-song vocalizations detected include ‘feeding cries’ similar to those 
observed in Alaskan humpback cooperative group feeding on herring schools”’, 
as well as ‘bow-shaped’ calls and ‘meows’ suited for night time communication’ 
among humpback individuals and coordination during group feeding activities. 

The spectra at frequencies higher than 1 kHz were dominated by odontocete 
vocalizations (Extended Data Fig. le-h). They consist of sperm whale slow and 
usual click, and creak sequences®*®, pilot and killer whale whistles, as well as 
a wide range of repetitive sequences of downsweep chirp signals roughly 0.7s 
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duration with varying bandwidths between 200 to 1,000 Hz, all occurring above 
1 kHz that can be attributed to pilot®® or killer whales®*®, or a variety of other 
delphinid species”*. The highly repetitive click sequences used for prey echoloca- 
tion occur at frequencies higher than 10 kHz for many odontocete species, beyond 
our sampling frequency range. The largest of them, the sperm whale, has slow and 
usual click and creak sequences with significant energy as low as 1 kHz*“’. The 
whistles and wide variety of downsweep chirp signals we recorded in the frequency 
range of odontocete vocalizations may serve as contact calls between individuals 
and to facilitate cohesion during foraging or travel®°>”". The fin, humpback, sei, 
minke, pilot and orca whales, and common and bottlenose dolphin species were 
visually sighted during the experiment. 

Determination of diel MM vocalization rate time series. The bearing-time tra- 
jectories of vocalizations from multiple MM species received by the POAWRS 
receiver array are shown for two days in Extended Data Fig. 4. For humpbacks, the 
alternation from song to non-song calls in their vocalization repertoire over several 
diel cycles are plotted as a function of bearing-time trajectory in Extended Data 
Fig. 5 for comparison. The line-array’s left-right ambiguity is resolved following 
the approach outlined in refs. 7, 71. The bearing ranges from 100° to 240° from 
true north for our array spans Georges Bank from east to west respectively. The 
diel vocalization rate (calls per min) time series shown in Fig. 4 and Extended Data 
Fig. 7 for each MM species is obtained by averaging the daily vocalization rate time 
series for that species over the entire experiment. The MM vocalization rate time 
series, initially calculated in 15 min bins, are averaged over a 1.25h running win- 
dow corresponding to the half power width quantifying the temporal correlation 
scale of the fish shoaling density time series. For both minke and odontocete whale 
click sequences, since the duration of each click sequence is highly variable from 
a few seconds to over a minute, the call rates shown in Fig. 4 and Extended Data 
Fig. 7 represent the number of 5s intervals that contain click sequences. 
Localization and call rate spatial distributions of diverse vocalizing MM 
species. The horizontal location of each MM vocalization consists of a range and a 
bearing estimate. The moving array triangulation (MAT)”! and the array invariant 
(AI)’!-?? methods were applied to determine the range of the vocalizations from 
the horizontal receiver array centre. Position estimation error, or the root mean 
squared (RMS) distance between the actual and estimated location, is a combina- 
tion of range and bearing errors quantified for this array in refs. 7,71 and 73. Range 
estimation error, expressed as the percentage of the range from the source location 
to the horizontal receiver array centre, for the MAT technique is roughly 2% at 
array broadside and gradually increases to 10% at 65° from broadside and 25% at 
90° from broadside, that is, near or at endfire’!. Range estimation error for the AI 
method is roughly 4% to 8% over all azimuthal directions’»”. Bearing estimation 
error of the time domain beamformer is roughly 0.5° at broadside and gradually 
increases to 6.0° at endfire”!. These errors are determined at the same experimental 
site and time period as the MM position estimates presented here, from thousands 
of controlled source signals transmitted by the same source array used to locate 
the herring shoals presented here and are based on absolute global positioning 
system (GPS) ground truth measurements of the source array’s position”’, which 
are accurate to within 3 m to 10m. More than 80% of vocalizing MMs are found 
to be located between 0° to 65° from the broadside direction of the horizontal 
receiver array. Position estimation error””! is less than 2km for majority of the 
vocalizing MMs localized in Figs 1 and 2 since they are found within roughly 50 km 
of the horizontal receiver array centre. This error is over an order of magnitude 
smaller than the spatial scales of the MM concentrations shown in Figs 1 and 2, and 
consequently has negligible influence on the analyses and results. 

The estimated locations for all MM calls over the duration of our data collec- 
tion are used to generate the call rate density map for each MM species shown in 
Figs 1 and 2 following the approach described in ref. 7. The location of each call is 
characterized by a 2D Gaussian probability density function with mean equal to 
the measured mean position from MAT or the AI method and standard deviations 
determined by the measured range and bearing standard deviations. The MM call 
rate density map for each species is determined by superposition of the 2D spatial 
probability densities for the location of each call, normalized by the total meas- 
urement time. (See Supplementary Information section III for more detailed diel, 
diurnal and nocturnal MM spatial vocalization rate distributions). 

Estimating MM detection region for POAWRS receiver array. The probability of 
detection Pp (r) of the MM vocalizations from each species as a function of range 
r from the POAWRS receiver array, shown in Fig. 3 and Extended Data Fig. 8, are 
calculated employing the formulation”!*”4-** and parameters”*”-®? provided in 
Supplementary Information section I. Higher transmission loss occurs in shallower 
waters due to more intense and pervasive bottom interaction®®*°? and mode 
stripping effects, especially at the low frequency vocalization range of large baleen 
whales. Transmission loss in deeper waters is typically significantly lower due 
to upward refraction®*° which leads to far less intense and pervasive bottom 
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interaction, as is the case in the deeper waters north of Georges Bank**"-*?. As 
noted in ref. 7, highly directional transmission loss may then occur when there are 
large depth variations about a receiver. This effect makes the detection range of 
whales in directions to the north of our receiver and Georges Bank typically much 
greater than in directions to its south where the relatively shallow waters of Georges 
Bank are found. The fact that we localized the sources of many whale calls at great 
distances along shallow water propagation paths on Georges Bank in directions 
where transmission loss was greater and found negligibly small vocalization rates in 
the deeper waters north of Georges Bank where transmission loss was much less 
(Fig. 3a), greatly emphasizes the finding that the vocalization rates originating from 
the region further north of Georges Bank were negligibly small. 

The dominant portion of the whale population within the POAWRS detection 

region is expected to occur in areas with dense vocalization rates for each MM 
species. This is because the probability of detecting no vocalizations in a region 
where a MM species is abundant over our two week observation period is neg- 
ligibly small. This implies that there are insignificant numbers of MMs within 
the detection region at locations with low call rates or no calls. This is consistent 
with acoustic-based marine mammal population density estimation, which makes 
the standard assumption that there are no whales expected to be present in a 
time-space region where there are no calls”*-*%, or that this number is negligibly 
small. The MM vocalization rate spatial distribution obtained here showing the 
northern flank of Georges Bank and its immediate vicinity as an autumn season 
MM hotspot is consistent with analysis of three decades (1970-2005) of visual line 
transect survey data for the entire GOM region!°*? (Supplementary Information 
section IV). 
Atlantic herring areal population density distribution and time series. The 
Atlantic herring instantaneous areal population density over wide areas shown in 
Figs 1 and 2 and Extended Data Fig. 6 were obtained from active OAWRS imaging 
after extensive calibration with tens of thousand instantaneous coincident con- 
ventional ultrasonic fisheries echosounding measurements!*", with fish species 
identification and physiological parameters extracted from trawl samples collected 
over the course of the experiment'®. The Atlantic herring shoals were consistently 
observed to form when the population density reached a critical value of 0.2 fish 
per m? where correlated behaviour began, following simple physical theories’”. 
This critical density was also consistently found to be the boundary where the 
dense shoal ended and diffuse populations that did not engage in correlated behav- 
iour began’?"|“, Populations within the dense shoals were variable from 0.2 to over 
10 fish per m2. These shoals extended roughly 20-60 m vertically in water-column 
depths of 80-200 m. In contrast, the diffuse fish populations approximately 0.053 
fish per m? were found close to (within 3-5 m of) the seafloor. The shoals formed 
near sunset and persisted until near sunrise, starting on the northern flank of 
Georges Bank and migrating southward to shallower waters on the bank. This 
diurnal behavioural pattern was consistently observed during our roughly two 
week measurement time period?" 

The annual autumn season Atlantic herring spawning activity on the northern 
flank of Georges Bank has been recorded by the US National Marine Fisheries 
Services (NMFS) for over 30 years, coinciding their survey of the Georges Bank 
herring stock with this period each year™!>'6, including our GOM 2006 experi- 
ment. The overall Georges Bank Atlantic herring stock estimate for autumn 2006 
based on OAWRS'* survey has been found to match well (within 10% to 20%) with 
independent NMFS stock estimates for 2006 and 2007 (ref. 16). 

Midwater trawl hauls were conducted on an ad hoc basis to sample significant 
backscatter observed during the NMFS acoustic survey in autumn 2006 (ref. 15). 
Stomach-content volume and diet composition of up to 15 herring per trawl haul 
(length-stratified subsampling) were recorded at sea. The majority of pre-spawning 
herring were not feeding as evidenced by the high proportion of empty stomachs’. 
During the acoustic/midwater trawl survey on Georges Bank in 2006, 95% of 
herring had nothing in their stomach (that is, stomach-content volume was 0cm*), 
which is comparable to other years where the proportion of empty stomachs ranged 
from 100% in 2003 to 69% in 2010. 

The areal resolution of the fish density distribution shown in Figs 1 and 2 
obtained from the OAWRS imaging system is 30 m in range after matched filter- 
ing and averaging, and varies between 150m to roughly 1.5 km in cross-range for 
the vast majority of fish hotspots included here, where the cross-range resolution 
is dependent on the array angular resolution, bearing, and fish range. The MM 
call rate distributions have spatial resolutions bounded by roughly 2 km that is of 
the same order of magnitude as the fish areal population density distribution in 
cross range. 

We combine the herring shoaling activity on Georges Bank, where massive 
and highly dense shoals occur predominantly during the night on the northern 
flank”*!!-14, with that for the region to the north of the Bank between Rodgers and 


Georges Basins, where we observe less persistent herring groups forming throughout 
the diel cycle!>. The areal fish population density (in fish/m) time series shown 
Fig. 4 and Extended Data Fig. 7 is obtained by averaging the daily herring 
shoaling areal population density time series initially calculated in 15 min bins, 
then averaged over a 1.25 h running window corresponding to the half power 
width quantifying the temporal correlation scale of the fish shoaling density time 
series. 

The temporal correlation coefficients rym,fish in Extended Data Table 1 quantify 
the degree of similarity between the diel MM call rate time series cyy (f) and the 
diel fish areal population density time series ngsp(t) and are calculated!!! via 


Whe i(Cmaa (te) — Ghana) (is (th) — Ap) 
LOM (coa te) = Gra)” [TRL (te) — A)? 


using measurements cyyy (t;) and cypy (tx) that are independent for j + k, so that 
te44 — t,=1.25 h. Similarly, the temporal correlation coefficient ryyvn,mu between 
the diel call rate time series of two distinct MM species is calculated via, 


TMM, fish = 


(1) 
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™MM},MMq] = 


(2) 


where the samples at time t; and t are independent for j + k, so that ti, — 
= 1.25h. 

The cumulative MM call rate distributions in Fig. 3b and Extended Data Fig. 8 

are plotted as functions of decreasing distance from shoaling herring during night 
and day respectively, and so take the value 0 at long ranges from herring shoaling 
locations and monotonically increase to 1 at herring shoaling locations. The prob- 
ability density function (PDF) for MM call rate density as a function of range from 
shoaling herring can be obtained as the absolute value of the range—derivative of 
the cumulative call rate distribution. The e-folding decay range of the cumulative 
call rate distribution for each MM species is the distance from herring shoals where 
the cumulative call rate distribution decays to 1/e = 0.37, so that 63% of vocaliza- 
tions from that species are contained within the e-folding decay range. 
On the high temporal correlation between blue whale vocalization rate and 
herring shoaling density. Besides the fish-feeding hypothesis, another possible 
explanation is that both blue whales and herring are responding to a common 
environmental stimulus, such as changing light level or their common prey- 
zooplankton abundance. However, the annual biological sampling of the autumn 
season spawning herring over multiple years, including this 2006 experiment time 
period, indicates that the herring have largely empty stomachs and are generally 
not feeding while engaged in spawning activities». 

Previous studies of diel dependence of blue vocalizations in their zooplankton- 
feeding areas found higher call rates in the night than day for longer duration 
(approximately 20s) type B calls, associated with mating and social interaction, and 
proposed an inverse relationship between call rate and level of feeding activity’. 
It is also possible that the type D calls measured here are related to increased 
social interaction between blue whales at night. 

The Supplementary Information provides more details on marine mammal, 
fish and zooplankton distributions in GOM and comparison to other ocean 
environments. 
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Extended Data Figure 1 | Spectrograms of MM vocalizations. a~h, Beamformed spectrograms of typical repetitive vocalizations from diverse MM 
species observed using the POAWRS receiver array in the Gulf of Maine from 19 September to 6 October 2006. 
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Extended Data Figure 2 | Coherent array processing enhances SNR. vocalization from a humpback individual roughly 35 km away from the 
a, b, Compare single hydrophone measured spectrogram (a) with POAWRS receiver array recorded on 2 October 2006 at 23:48:45 EDT is 
spectrogram after coherent beamforming (b) with 64-element enhanced by 18 dB above the background noise after beamforming in b 


sub-aperture of POAWRS 160-element hydrophone array. The song where whale bearing is —64.16° from array broadside. 
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Extended Data Figure 3 | Pitch-tracks of baleen and toothed whales. calls have some overlapping bandwidth, they can be well separated using 
a, Pitch-tracks of repetitive mysticete vocalizations in the 10 to 100 Hz the upper frequency fy and slope df/dr features (Extended Data Table 2). 
range. Thick solid curves are the means of roughly 500 to 1,000 b, Mean pitch-track and instantaneous bandwidth of repetitive odontocete 
vocalizations of each type. Mean instantaneous bandwidth of the pitch- downsweep vocalizations in the 1 to 4kHz range. 
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Extended Data Figure 4 | Daily POAWRS measured MM vocalization respect to the instantaneous spatial locations of the receiver array centre. 
bearings. a, b, MM vocalization bearings from diverse species measured The techniques used here for resolving source bearing ambiguity about the 
by POAWRS receiver array on 1 October 2006 (a) and 2 October 2006 (b). horizontal line-array axis are provided in Methods section 3. The shaded 
The bearings are measured from true North in clockwise direction with bars on the x axis indicate the operation time periods of the receiver array. 
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Extended Data Figure 5 | Daily humpback vocalization repertoire. a—c, Bearings and repertoire of humpback vocalizations measured by POAWRS 
receiver array on 1 October 2006 (a), 2 October 2006 (b), and 3 October 2006 (c). The ‘meow; ‘bow, and ‘feeding’ call characteristics are provided in ref. 7. 
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Extended Data Figure 6 | Diel Atlantic herring shoaling areal 
population densities. Measured herring shoaling areal population 
densities (ranging from 0.2 fish per m? to over 10 fish per m*) determined 
from OAWRS!™9 survey in the Gulf of Maine from 26 September 2006 to 
6 October 2006, upon calibration with tens of thousands of coincident and 
co-located conventional ultrasonic fisheries echosounding measurements, 


combined with trawl sampling for identity and biological-physiological 
characterization of fish populations'*!°, The mean diffuse herring density 
of 0.053 fish per m? is determined from conventional ultrasonic fisheries 
echosounding. The bathymetric data (contours shown in grey) were 
obtained from the US National Centers for Environmental Information. 
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Extended Data Figure 7 | Diel MM call rate and herring shoal areal obtained from averaging the time series over multiple diel cycles from 
population density time series. Mean diel call rates for sei whales and 26 September 2006 to 6 October 2006. The period from roughly 2-6 EDT 
odontocetes in general are not correlated to the diel Atlantic herring shoal _ contains a data gap. 
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Extended Data Figure 8 | Cumulative diurnal MM call rate distribution. | (humpback), 22.5 to 0km (minke), 11.2 to 8.1 km (sei), and 22.4 to 5.5km 


Cumulative diurnal MM vocalization rate distribution and azimuthally- (odontocetes). The percentage of vocalizations that fully overlap with 
averaged POAWRS MM POD as a function of minimum distance from herring shoaling areas increase from day to night by 0% to 18% (blue), 
diurnal herring shoaling areas. The e-folding distances of the cumulative 14% to 40% (fin), 6% to 44% (humpback), 0% to 71% (minke), and 5% to 
MM vocalization rate distributions decrease from day (shown here) to 24% (sei), but decrease by 36% to 29% (odontocetes). 


night (in Fig. 3b) by 27.3 to 7km (blue), 9.3 to 3.9km (fin), 51.7 to 3.5km 
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Extended Data Table 1 | MM species daily call rate and temporal correlations 


MM Cum °MM,,MMzz 
: Cy M night 
species calls/day | Cyrazaay | MMfish 
blue | fin |humpback)minke| sei | odontocetes 

blue 470 10 0.91 1 | 0.67 0.78 0.48 |-0.17 -0.46 

fin 14,000 Zu 0.82 | 0.67} 1 0.85 0.79 | 0.02 -0.27 

humpback 2,000 10 0.87 | 0.78 | 0.85 1 0.84 |-0.29 -0.30 

minke 690 5 0.64 | 0.48 | 0.79 0.84 1 |-0.18 -0.20 

sei 440 0.77 -0.11  |-0.17| 0.02 -0.29 -0.18 | 1 0.44 

odontocetes| 3200 0.83 -0.42 |-0.46|-0.27| -0.30 -0.20 | 0.44 1 
The temporal correlation rymiish of MM vocalization rate time series to fish shoa 


ing areal population density time series, as well as the temporal correlation rymi, mm between distinct MM species 
vocalization rate time series over the diel cycle are calculated using Methods equations (1) and (2) respectively. 
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Extended Data Table 2 | Large baleen whale repetitive vocalization pitch-track features 
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Vocalization characteristics in the 10 to 100 Hz range from large baleen whales detected with POAWRS receiver array in the Gulf of Maine in autumn 2006. The characteristics are the lower f., 
upper fu, and mean f frequencies, mean instantaneous bandwidth 8, relative instantaneous bandwidth B/f duration r, slope df/dr, and curvature d?f/dr?. The slope and curvature are obtained from 


second order nonlinear curve-fit to the vocalization traces obtained via pitch-tracking. 
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Sensory experience regulates cortical inhibition by 
inducing IGF1 in VIP neurons 


A.R. Mardinly"*, I. Spiegel**, A. Patrizi?, E. Centofante®, J. E. Bazinet?, C. P Tzeng?, C. Mandel-Brehm?, D. A. Harmin’, 


H. Adesnik!, M. Fagiolini? & M. E. Greenberg? 


Inhibitory neurons regulate the adaptation of neural circuits to 
sensory experience’, but the molecular mechanisms by which 
experience controls the connectivity between different types of 
inhibitory neuron”® to regulate cortical plasticity are largely 
unknown. Here we show that exposure of dark-housed mice to 
light induces a gene program in cortical vasoactive intestinal 
peptide (VIP)-expressing neurons that is markedly distinct from 
that induced in excitatory neurons and other subtypes of inhibitory 
neuron. We identify Igf1 as one of several activity-regulated genes 
that are specific to VIP neurons, and demonstrate that IGF1 
functions cell-autonomously in VIP neurons to increase inhibitory 
synaptic input onto these neurons. Our findings further suggest that 
in cortical VIP neurons, experience-dependent gene transcription 
regulates visual acuity by activating the expression of IGF1, thus 
promoting the inhibition of disinhibitory neurons** and affecting 
inhibition onto cortical pyramidal neurons. 

To explore how sensory experience affects gene expression in VIP 
neurons, we examined this process in the visual cortex of adult mice 
that were housed in standard conditions, in complete darkness (that is, 
dark-housed), or dark-housed and then exposed to light for increasing 
amounts of time®” (Fig. 1a). Light deprivation for as little as 12h drives 
robust gene expression after light exposure, and increasing durations of 
dark-housing accentuate the gene induction response (Extended Data 
Fig. 1a) irrespective of the phase of the circadian rhythm (Extended 
Data Fig. 1b). To purify RNA selectively from VIP-expressing and other 
inhibitory neuron subtypes, we generated mice that were heterozygous 
for alleles of either Vip-cre, Sst-cre or Pv-cre, and were also heterozygous 
for the Rpl22-HA (RiboTag) allele’, which expresses a haemagglutinin 
(HA)-tagged ribosomal protein specifically in Cre-expressing neurons 
(Fig. 1a). For purposes of comparison, we also purified ribosome- 
bound RNA from excitatory and inhibitory neurons, labelled by Emx1-cre 
or Gad2-cre. 

By quantitative real-time PCR (qPCR), we find that messenger RNAs 
for cell-type-specific marker genes are highly enriched in the appro- 
priate samples (Extended Data Fig. 1c) and that light exposure induces 
the expression of early-response genes in each Cre line (Extended Data 
Fig. 1d). To quantify experience-induced gene expression at a genome- 
wide level, we performed RNA-seq on RNA isolated from the dark-housed/ 
light-exposed RiboTag-mice (Supplementary Table 1) (Fig. 1b, c and 
Extended Data Fig. 2a, b). This analysis identified genes which exhib- 
ited reproducible changes in expression levels in response to visual 
stimulation in at least one Cre line (n = 602; see Supplementary Table 2 
and Methods) and thus allowed us to ask how levels of these experience- 
regulated genes are correlated across the different neuronal subtypes 
compared to non-regulated genes (n = 13,678) (Fig. 1d-i). We found 
that the expression of experience-regulated genes is remarkably dissim- 
ilar across different neuronal subtypes when compared to genes that 
are not regulated by sensory experience (irrespective of differences in 


the number or expression levels of experience-regulated genes; Fig. 1f-i 
and Extended Data Fig. 3a-d). While unique subsets of experience- 
responsive genes were identified in each neuronal subtype (Fig. 2a, b 
and Extended Data Fig. 3e, f), VIP neurons are the most responsive 
to sensory stimulation (Fig. 1d, e) and possess an experience-induced 
gene expression program that is markedly distinct from the other neu- 
ronal subtypes analysed (Fig. 1h, i). This suggests that in VIP neurons 
the experience-dependent gene program may have a unique function 
in adapting the cortex’s neural circuits to sensory experience. 

We hypothesized that experience-regulated genes that are specifi- 
cally expressed and selectively regulated in VIP neurons are likely to 
have important functions in regulating the synaptic connectivity onto 
VIP neurons. Thus, we first identified the mRNAs that are specifi- 
cally enriched in each subtype (Extended Data Fig. 4a; Methods) and 
cross-referenced these genes with the list of experience-regulated genes 
(Extended Data Fig. 4b). This analysis identified 31 genes that are both 
cell-type-specific and experience-regulated, 11 of which are specific 
to VIP neurons (Supplementary Table 4). Notably, secreted molecules 
are significantly over-represented in this gene set (GO-term ‘Secreted’ 
P=0.002) and each type of neuron has its own set of cell-type-specific 
experience-regulated secreted factors, including four experience- 
induced secreted molecules that are specific to VIP neurons (Igf1, Crh, 
Prok2, Fbin2; Fig. 2b, Supplementary Table 4). 

We next performed fluorescent in situ hybridization (FISH) on sec- 
tions of visual cortices of dark-housed/light-exposed mice to quantify 
the percentage of cells that co-express an inhibitory subtype marker and 
the respective secreted factor (Fig. 2c-f). Of the four secreted factors, 
Igf1 is the one factor that is expressed in the vast majority of VIP 
neurons, and whose expression is highly enriched in these neurons 
(Fig. 2d). We were unable to reliably identify Fbln2-expressing cells, and 
Prok2 was expressed nearly exclusively in a sparse subpopulation of VIP 
neurons (Fig. 2f), consistent with the low expression level of these genes 
in the RiboTag-seq experiments (Extended Data Fig. 4c). While the 
FISH analysis revealed that in the cortex Crh is highly enriched in VIP 
neurons compared to PV and SST neurons (Fig. 2e), this gene is also 
expressed in Py-/Sst-/ Vip-negative cortical interneurons’. Since IGF1 is 
the sole experience-induced secreted factor that is selectively expressed 
in most VIP neurons, we focused our subsequent analysis on IGF1. 

Previous reports have suggested that IGF1 is synthesized in the 
cortex'®, but the function of cortical-derived IGF1 was unknown. 
Global disruption of the Igf1 gene results in abnormally small animals 
with smaller brains that contain smaller neurons with dendrites that are 
less branched and contain fewer synapses!!"}3, and the effects of IGF1 
on brain development and function are due at least in part to IGF1 
that is produced by non-neural tissues and then enters the brain’. To 
investigate specifically VIP neuron-derived IGF1, we crossed Vip-cre 
mice to both IGF1 conditional-knockout mice’ and Cre reporter mice. 
Disruption of Igf1 specifically in VIP neurons had no effect on the 
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thickness of the cortical layers, on the number and layer distribution 
of VIP neurons, or on the size of VIP neuron cell bodies at postnatal 
day 21 (that is, P21) (Extended Data Fig. 5a—d). To test whether VIP 
neuron-derived IGF] affects excitatory and/or inhibitory inputs to VIP 
neurons, we recorded miniature inhibitory or excitatory postsynaptic 
currents (mIPSCs or mEPSCs) in VIP neurons in acute visual cortex 
slices; we found that conditional deletion of Igf1 in VIP neurons leads to 
a significant reduction in mIPSC frequency (Fig. 3a) but not amplitude 
(Fig. 3b). Since conditional deletion of Igfl had no effect on the fre- 
quency or amplitude of excitatory mEPSCs on VIP neurons (Fig. 3c, d), 
these findings suggest that VIP-neuron-derived IGF1 specifically 
enhances inhibitory synaptic input onto VIP neurons. 

To test whether IGF1 functions cell-autonomously to regulate 
inhibitory input onto the cell from which it is expressed, we used a 
virus-based approach to acutely knockdown Igf1 expression in only 
a few VIP neurons. We generated short hairpin RNA (shRNA) con- 
structs against Igfl (Extended Data Fig. 6a, b), injected low titre AAVs 
expressing the shRNA and Cre-dependent enhanced green fluorescent 
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protein (eGFP) into the visual cortex of P14-15 Vip-cre mice, and 
recorded mIPSCs and mEPSCs one week later (P20—P22) in eGFP- 
positive VIP neurons that are surrounded by non-infected VIP neurons 
(Fig. 3e). This sparse and acute knockdown of Igf1 in VIP neurons 
using either of two distinct shRNAs against Igf1 resulted in a marked 
reduction in mIPSC frequency and amplitude as compared to VIP neu- 
rons infected with a control shRNA (Fig. 3f, g), but had no significant 
effect on mEPSCs (Fig. 3h, i). These effects are not due to altered VIP 
neuron morphology (Fig. 3j and Extended Data Fig. 6c), indicating 
that VIP-neuron-derived IGF1 acutely promotes inhibition onto VIP 
neurons in a cell-autonomous manner. 

To determine if VIP-neuron-derived IGF1 regulates inhibitory inputs 
onto other types of cortical neurons, we adopted a protocol that leads to 
widespread infection of neurons in the cortex (see Methods). Injecting 
AAVs into Vip-, Pv- or Sst-cre mice to label these cells with eGFP while 
knocking down Igf1 in VIP neurons, we recorded mIPSCs from each 
cell type and found that early knockdown of Igf1 in the cortex decreases 
mIPSCs frequency in VIP neurons, but does not affect mIPSCs onto 
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PV, SST, or pyramidal neurons (Extended Data Fig. 6e-1). Furthermore, 
ELISA-based analysis of IGF1 levels in the blood of mice whose cortices 
were injected with these viruses demonstrated that removing IGF1 from 
VIP neurons did not alter the level of serum-derived IGF1 (Extended 
Data Fig. 6d). While we formally cannot exclude that serum-derived 
IGF1 contributes to inhibition onto VIP neurons, this finding indi- 
cates that the decrease in mIPSCs in VIP neurons that express Igf1 
shRNAs is due at least in part to reduced Igfl expression in VIP neu- 
rons. Thus, VIP-neuron-derived IGF1 regulates the inhibitory inputs 
onto the VIP neuron in which it is produced, probably via local release 
from VIP neurons. Consistent with this idea, we find that the Igf1 
splice variant expressed by VIP neurons encodes an isoform of IGF1 
containing a heparin binding domain (Igf1.4; Extended Data Fig. 7a) 
that may limit the diffusion of IGF1 and facilitate its local action!®. 
We next overexpressed IGF1 in VIP neurons by injecting Vip-cre mice 
with an AAV construct that drives expression of an epitope-tagged ver- 
sion of IGF1 together with EGFP in a Cre-dependent manner (Fig. 3k 
and Extended Data Fig. 7b, c) and assessed the effect on mIPSCs and 
mEPSCs (Fig. 3l-o). We find that when overexpressed in VIP neurons, 
IGF1 selectively promotes inhibition onto VIP neurons, as it has no 
effect on mEPSCs in these cells. Likewise, ectopic expression of IGF1 
in SST and excitatory neurons (by intracortical injections into Sst- or 
Emx1-cre mice, respectively; Extended Data Fig. 7d—g) leads to a sim- 
ilar increase in mIPSC frequency in these cells. These findings raise 
the possibility that the selective expression of Igfl in VIP neurons is 
required for the proper organization and function of cortical circuits, 
as aberrant IGF1 expression could enhance inhibition indiscriminately 
within cortical circuits by signalling through IGF1 receptors that are 
ubiquitously expressed in these neurons (Extended Data Fig. 7h-j). 
The change in mIPSC frequency upon Igf1 knockdown in VIP neu- 
rons could reflect a change in the presynaptic probability of release 
and/or a reduction in inhibitory synapse number and/or strength. By 
paired-pulse stimulation we find that Igfl knockdown in VIP neu- 
rons does not significantly alter the probability of release of vesicles 
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Pv Sst Vip 
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from either inhibitory (Fig. 4a) or excitatory (Extended Data Fig. 7k) 
terminals that synapse onto VIP neurons. To test evoked inhibition, we 
co-injected Vip-cre mice with either Igfl or control shRNA AAVs and 
an AAV encoding the excitatory light-activated ion channel ReachR"”. 
Performing paired voltage clamp recordings from eGFP-positive VIP 
neurons and neighbouring pyramidal cells to control for variation in 
stimulation intensity, we found that the strength of light-evoked inhi- 
bition onto VIP neurons is decreased when Igfl expression is knocked 
down in VIP neurons (Fig. 4b). These experiments suggest that the pri- 
mary site of IGF1 action is post-synaptic and indicate that experience- 
dependent activation of Igfl expression increases the number and/or 
strength of functional inhibitory synapses that form on VIP neurons. To 
test whether this IGF1-dependent decrease in inhibition alters the fre- 
quency of action potentials in these neurons, we performed cell-attached 
recordings from eGFP-labelled VIP neurons expressing control or Igf1 
shRNAs. We find that VIP neurons lacking IGF1 fire action potentials 
at a significantly higher rate than controls (Fig. 4c). Given that VIP 
neurons disinhibit cortical circuits, it seems likely that this decreased 
firing of VIP neurons might alter how the cortex responds to sensory 
experience. To begin to investigate this possibility, we next assessed the 
effect of knocking down Igfl1 expression on visual cortex plasticity. 
Cortical inhibition regulates ocular dominance (OD) plasticity?" 
and visual acuity'®, and hyper-activation of VIP neurons drives a form 
of adult cortical plasticity”. To determine whether knocking down Igf1 
expression in VIP neurons affects visual cortex function, we inj ected 
control or Igfl shRNA AAVs into the binocular zone of visual cortices 
of P18 Vip-cre mice and recorded visual-evoked potentials between 
P28 and P32 (Fig. 4d and Extended Data Fig. 8a). Stimulation of the 
contralateral or ipsilateral eye with gratings at low spatial frequency 
elicited robust visual-evoked potentials both under control and Igf1- 
knockdown conditions (Extended Data Fig. 8b-d). Furthermore, the 
ratio between the contralateral and ipsilateral eye’s response (C:I ratio) 
was similar in the presence or absence of Igf1 (Extended Data Fig. 8e), 
indicating that basic visual cortex function is not obviously disrupted 


17 MARCH 2016 | VOL 531 | NATURE | 373 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


cKO in VIP neurons—mIPSC 


cKO in VIP neurons—mEPSC 


AAV: shRNA-Flex-eGFP. 
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Figure 3 | IGF1 promotes inhibitory inputs to VIP neurons in a cell- 
autonomous manner. a—d, Bar graph and cumulative distribution of the 
frequency and inter-event intervals of mIPSCs or mEPSCs recorded from 
Igf1 wild-type (WT) or conditional-knockout (cKO) VIP neurons (mIPSC 
frequency, P= 0.046; amplitude, P= 0.3; mEPSC frequency, P = 0.44; 
amplitude, P= 0.9, Mann-Whitney U-test). e, Example image of sparsely 
infected VIP neurons upon injection of AAV-shRNA-hUbc-Flex-eGFP 
into mice expressing tdTomato in all VIP neurons (white box indicates the 
magnified area; arrows indicate infected VIP neurons; arrowheads indicate 
non-infected VIP neurons; scale bars, 100 1m in main image; 501m in 
magnification). f-i, Bar graph and cumulative distribution of mIPSC/ 
mEPSC frequency, inter-event interval and amplitude recorded from 

VIP neurons sparsely infected with control or Igfl shRNAs (mIPSC 
frequency: shRNA 1, P = 0.05; shRNA 2, P=0.042; mIPSC amplitude: 
shRNA 1, P=0.004; shRNA 2, P=0.001; mEPSC frequency, shRNA 1, 


upon Igf1 knockdown. Remarkably, when we assessed visual acuity of 
the contralateral eye by increasing the spatial frequency of the gratings 
presented, the mice injected with AAVs expressing Igfl shRNA exhib- 
ited significantly increased visual acuity as compared to mice injected 
with control AAVs (Fig. 4g). 

To test whether the effect of Igfl knockdown is experience-dependent, 
we next monocularly deprived mice for a brief period of time, begin- 
ning at the peak time of ocular dominance plasticity (that is, at P26-28, 
Fig. 4d). After four days of monocular deprivation, we recorded visual- 
evoked potentials from the visual cortex contralateral to the deprived 
eye and quantified the C:I ratio upon stimulation at low spatial 
frequency as well as the visual acuity upon stimulation of the deprived 
eye. Brief monocular deprivation led to a reduction in the C:I ratio 
in mice injected with AAVs expressing either control or Igfl shRNA; 
this is a consequence of the reduction in the contralateral response 
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P=0.13; shRNA 2, P= 0.07; mEPSC amplitude: shRNA 1, P=0.77; 
shRNA 2, P=0.44. Mann-Whitney U-test). j, Sholl analyses of VIP 
neurons infected with control or Igfl shRNA (P = 0.76, two-way repeated- 
measures ANOVA). k, Expression of epitope-tagged IGF 1.4 in VIP- 
neurons. Cortices of P3 Vip-cre/+ mice were injected with AAVs driving 
Cre-dependent expression of SSHA-IGF1.4-Myc-F2A-eGFP (top) or 
F2A-eGFP (bottom) and stained at P20 for HA (red) and eGFP (green) 
(Scale bar, 101m). l-o, Bar graphs and cumulative distribution plots 
showing mIPSC/mEPSC amplitude and frequency/inter-event interval 
in VIP neurons infected with a control AAV or an AAV over-expressing 
(OE) IGF1. mIPSC: amplitude, P= 0.05; frequency, P = 0.02; mEPSC: 
amplitude, P= 0.55; frequency, P=0.86. Mann-Whitney U-test. 

a-d, f-j, |-o, Numbers inside bars indicate the number of cells recorded; 
*P <0.05 by Mann-Whitney U-test. 


(Extended Data Fig. 8b-e)'*. Notably, when we tested visual acuity after 
brief monocular deprivation, both Igf1 and control shRNA injected 
mice exhibited similar levels of amblyopia (that is, loss of visual acuity) 
in the deprived eye (Fig. 4e-g), despite the higher visual acuity in 
the Igfl shRNA injected mice that were not monocularly deprived 
(Fig. 4e-g). These findings indicate that VIP neuron-derived IGF1 
regulates visual acuity in an experience-dependent manner and may 
function as a sensory-dependent brake on cortical plasticity. The 
observation that in response to sensory experience IGF1 in VIP neu- 
rons controls inhibition, taken together with the previous finding that 
experience induces BDNF in excitatory neurons to regulate excitatory- 
inhibitory balance”!”’, suggests a model in which each type of neuron 
within a cortical circuit expresses a unique set of experience-induced 
secreted factors that control specific synaptic inputs onto the neuron 
and plasticity within a neural circuit®”. 
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Figure 4 | VIP-neuron-derived IGF1 regulates VIP neuron function and 
regulates visual acuity in an experience-dependent manner. a, Paired-pulse 
recordings from VIP neurons infected with control or Igfl shRNA (P= 0.96, 
two-way ANOVA). b, Left, average traces of light-evoked IPSCs (eIPSC) from 
paired recordings of VIP neurons infected with control or Igfl shRNA (green 
traces) and neighbouring pyramidal neurons (Pyr., black traces). Right, 
quantification of eIPSC amplitude of the VIP neuron after infection with 
AAVs expressing control or Igfl shRNA, normalized to the eIPSC amplitude 
of the paired pyramidal neuron (P=0.01, Mann-Whitney U-test). c, Average 
firing rate of VIP neurons infected with Igf1 or control shRNA (P= 0.04, 
Mann-Whitney U-test). b, c, Numbers inside bars indicate the number of 
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METHODS 


No statistical methods were used to predetermine sample size. 

Visual stimulation. For calibrating the duration of the dark housing period before 
light exposure, C57B16 wild-type mice were housed in a standard light cycle until 
they were placed in constant darkness for varying amounts of time before anal- 
ysis at postnatal day 56. At P56, all mice were either sacrificed in the dark (dark- 
housed condition) or light-exposed for 1, 3, or 7.5h before being sacrificed. The 
eyes of all animals were enucleated (for the dark-housed condition, enucleation was 
performed in the dark) before dissection of the visual cortex in the light. 

For RiboTag-experiments, mice were reared in a standard light cycle and then 

housed in constant darkness for two weeks starting from P42; at P56, all mice were 
either sacrificed in the dark (dark-housed condition) or light-exposed for 1, 3, or 
7.5h before being sacrificed. Additional cohorts of mice for the ‘standard’ condition 
were housed in a standard light cycle until P56 when they were euthanized. The 
eyes of all animals were enucleated (for the dark-housed condition, enucleation was 
performed in the dark) before dissection of the visual cortex in the light. 
RNA isolation, reverse transcription, qPCR analysis. Total RNA was extracted 
with TRIzol reagent (Sigma) according to the manufacturer’s instructions, and 
RNA quality was assessed on a 2100 BioAnalyzer (Agilent); all RNAs were treated 
with DNasel (Invitrogen) before reverse transcription. For the cloning of ribopro- 
bes, total RNA was extracted from whole adult C57B16 wild-type mouse brains 
and cDNA was prepared using SuperScript II kit (Life Technologies). For real- 
time quantitative PCR experiments aimed at calibrating the duration of the dark 
housing period, total RNA was extracted for each sample from the visual cortices 
of one animal. For real-time quantitative PCR experiments aimed at testing the 
efficacy of shRNA constructs directed against Igf1, total RNA was isolated from 
two pooled 24 wells of cultured cortical neurons for each condition. For qPCR 
experiments, RNA was reverse-transcribed with the High Capacity cDNA Reverse 
Transcription kit (Life Technologies). Real-time quantitative PCR reactions were 
performed on the LightCycler 480 system (Roche) with LightCycler 480 SYBR 
Green I Master. Reactions were run in duplicates, triplicates or quadruplicates, and 
B-actin (Actb) or 83-tubulin (Tubb3) levels were used as an endogenous control for 
normalization using the AAC, method”, Real-time PCR primers were designed 
using the Universal ProbeLibrary (Roche) as exon-spanning whenever possible and 
answered the following criteria: linear amplification over three orders of magnitude 
of target concentration, no amplification product in control samples that were not 
reverse-transcribed (that is, control for contamination with genomic DNA), no 
amplification product in control samples where no template was added (that is, 
control for primer dimers), amplification of one singular product as determined 
by melt-curve analysis and analysis of the product in agarose gel electrophoresis 
and sequencing of the PCR product. The qPCR primers used in this study are listed 
in Supplementary Table 6. 

For analysis of light-induced gene expression in wild-type mice, the gene 
expression levels were analysed in four mice (two males and two females) at 
each time point. The data were calculated as fold change relative to the average 
of the overnight dark-housed condition and normalized to the average of the 
maximally induced time point. Data in figures represent the mean and s.e.m. 
of four mice. 

For assessing Igf1 levels in cortical cultures infected with shRNA-expressing len- 
tiviral constructs, GPCRs were performed in quadruplicates for each condition and 
fold changes were calculated relative to the non-infected non-stimulated cultures. 
Data were normalized to the maximally induced condition in each biological repli- 
cate, and data in figures represent the mean and s.e.m. of three biological replicates. 
RiboTag-purifications, RiboTag-qPCR and RiboTag-seq. Immunoprecipitation 
and purification of ribosome associated RNA was performed essentially as 
described®*, with minor modifications: lysis of the samples was performed in 
the presence 10 mM Ribonucleoside Vanadyl Complex (NEB, Ipswich, MA), and 
immunoprecipitation was performed with a different anti-HA antibody (HA-7, 12 1g 
per immunoprecipitation, Sigma). In brief, the visual cortices were dissected, 
flash frozen in liquid nitrogen and then kept at —80°C until further processing. 
Visual cortices from three individual animals (each sample contained both male 
and female animals) were pooled for each biological replicate, and three biological 
replicates were performed. After lysis of the tissues and before immunopurifi- 
cation, a small fraction of lysate of each sample (that is, ‘input’) was set aside 
and total RNA was extracted with TRIzol reagent followed by the RNEasy Micro 
Kit’s procedure (Qiagen, Valencia, California). After immunopurification of the 
ribosome-associated RNAs, RNA quality was assessed on a 2100 BioAnalyzer 
(Agilent, Palo Alto, California) and RNA amounts were quantified using the Qubit 
2.0 Fluorometer (Life Technologies). Only samples with RIN numbers above 8.0 
were considered for analysis by qPCR and RNA-seq. For all RNA samples of 
sufficient integrity, 5-10 ng of RNA were SPIA-amplified with the Ovation RNA 
Amplification System V2 (NUGEN, San Carlos, California), yielding typically 
5-8 jg of cDNA per sample. 


Quantitative RT-PCR was performed as described above and relative expression 

levels were determined in every experiment by normalizing the Ct-values to those 
of beta-Actin (ActB) from the 0 h input using the AAC, method”. To determine 
the fold-enrichment (IP/Input), the actin-normalized expression levels for every 
time point of every biological replicate were averaged, and the grand averages from 
the IP and Input were divided to find the IP/Input ratio. To calculate fold-induction 
for each biological replicate, each time point was divided by the maximal value 
occurring in that biological replicate, such that the maximal value was set to 1 in 
each biological replicate. The mean and standard error were calculated at each time 
point from these normalized values. All samples were analysed by qPCR for purity 
and light-induced gene expression before analysis by high throughput sequencing. 
RNA-seq and analysis. SPIA-amplified samples from RiboTag- 
immunoprecipitated fractions for each of the five stimulus conditions and each 
of the five Cre lines were prepared as described above and processed in triplicate 
(75 samples total). For preparing sequencing libraries, 2 \.g of each amplified CDNA 
were fragmented to a length of 200-400 bp using a Covaris S2 sonicator (Acoustic 
Wave Instruments) using the following parameters: duty cycle: 10%, intensity: 
5, cycles per burst: 200, time: 60 s, total time: 5 min. After validating the frag- 
ment length of the sonicated cDNA using a 2100 BioAnalyzer (Agilent, Palo Alto, 
California), 2 \1g of the fragmented cDNA were used for sequencing library prepa- 
ration using the PrepX DNA kit on an Apollo 324 robot (IntegenX). The quality of 
completed sequencing libraries was assessed using a 2100 BioAnalyzer (Agilent, 
Palo Alto, California) and the completed libraries were sequenced on an Illumina 
HiSeq 2000 instrument, following the manufacturer’s standard protocols for 
single-end 50 bp sequencing with single index reads. Sequencing typically yielded 
30-80 million usable non-strand-specific reads per IP sample. Reads were mapped 
to the mm9 genome using TopHat (v.2.0.13) and Bowtie (2.1.0.0)”°. On average, 
~70% of mapped IP reads were uniquely mapped to the mm9 genome allowing 
for 0 mismatches and were therefore assignable to genic features (one RiboTag-seq 
library (Sst-cre, standard-housing, biological replicate 2) was excluded from anal- 
ysis due to low mappability). Values from all IP libraries were normalized using 
Cufform (v.2.2.1), and values from the Cuffnorm output file ‘genes-Count_Table’ 
(normalized reads) were taken as a proxy for gene expression. P values were gener- 
ated for each Cre line for each dark-light conditions using Cuffdiff (v.2.2.1) using 
the time series (-T) flag based on three biological replicates. 
Identification and classification of experience-regulated transcripts. To identify 
transcripts regulated by visual experience, for each biological replicate of each Cre 
line, the fold change in normalized reads was calculated for each gene at every 
time point (dark-housed/standard-housed, 1 h light/dark-housed, 3 h light/ 
dark-housed; 7.5 h light/dark-housed). Genes were flagged as experience-regulated 
in a given Cre line if they met the following conditions in at least one sample: (1) 
P value <0.005, (2) mean fold change of twofold or greater, (3) fold changes of 
2 or higher in 2 of 3 biological replicates, (4) the mean expression value in at least 
one sample must be above absolute expression threshold (set at the 40th percentile 
of all observed values). 

To determine in which Cre lines genes were regulated by experience, genes 
were simply classified according to the above criteria. However, for this analysis 
we excluded the Gad2-cre line, since Pv-, Sst- and Vip-cre all label subsets of the 
neurons labelled by Gad2-cre. However, we did detect genes regulated solely in 
Gad2-cre, but no other Cre lines; we reasoned that these genes are probably regu- 
lated by experience in a population of 5HT3aR*/VIP~ neurons that are contained 
in Gad2-cre but none of the other Cre lines. 

We classified the set of experience-regulated genes into categories ‘early’ ‘late, 
and ‘long-term’ based on the fastest kinetics observed. When genes were found 
to be elevated and/or suppressed at multiple time points, we assigned them to the 
categories based on the most rapid observed change. For example, while Fos levels 
are elevated over dark housing at 1, 3 and 7.5 h of light exposure and suppressed 
after two weeks of dark housing, Fos is classified as ‘early-up’ because it is elevated 
at 1 h after light exposure. 

Linkage analysis. All linkage analysis was performed using the ‘single’ method and 
‘Cityblock metric using Matlab’ linkage function. To determine the branch-order 
significance of the cladogram resulting from clustering of the 602 experience- 
regulated genes, we generated 1,000 cladograms from 602 sets of random expressed 
genes (including experience-regulated genes, with replacement) and asked how 
often we generated a cladogram with an identical branch order at the level of the 
Cre lines. Only 11 sets of 1,000 random genes sets generated an identical tree. For 
the purposes of this analysis, we only compared the branches above the level of 
the individual Cre line. 

Identification of cell-type-specific transcripts. To identify cell-type-enriched 
transcripts, an enrichment score was calculated for every transcript in every 
Cre line for each biological replicate. This enrichment score was calculated by 
dividing the maximum expression value observed in a given Cre line by the 
maximum expression value observed across all conditions for all other Cre lines 
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(GABAergic subtypes were not required to be enriched above Gad2-cre). The 
enrichment scores for a set of known cell-type-specific genes were evaluated (Vglut, 
Tbr, Pvalb, Sst, Vip), and our threshold was set at the enrichment score of the 
cell-type-specific gene with the lowest score (Slc17a7/Veglut1, at 5.5-fold-enriched 
in Emx1-cre). Transcripts were considered to be expressed in a cell-type-specific 
manner (or ‘highly enriched’) in a given Cre line if their mean enrichment score 
was above this threshold and if the enrichment score exceeded this threshold in 
2 out of 3 biological replicates. 

Cloning of riboprobes, knockdown and expression constructs. Cloning of all 
constructs was done using standard cloning techniques, and the integrity of all 
cloned constructs was validated by DNA sequencing. Templates for the riboprobes 
for Igfl, Gad1, Pvalb, Sst and Vip were prepared by PCR-amplification of cDNA 
fragments generated from total RNA isolated from adult C57B16 mouse brains (see 
Supplementary Table 7 for primer sequences) and cloning of the respective PCR 
fragments into the pBlueScript II vector (Agilent Technologies). 

Lentiviral shRNA constructs were generated by cloning shRNA stem loop 
sequences against Igfl (Igfl shRNA 1: GGTGGATGCTCTTCAGTTC; Igfl 
shRNA 2: TGAGGAGACTGGAGATGTA) and Luciferase (Luc, control: 
ACTTACGCTGAGTACTTCG) into a modified version of pLentiLox3.7”° in 
which the CMV promoter driving the expression of eGFP was replaced with an 
hUbc promoter and in which the loxP sites surrounding the hUbc-eGFP cassette 
were removed. The loop sequence used in these shRNA constructs is based on 
miR-25 (CCTCTCAACACTGG)”’”. shRNA-expressing AAV-constructs (pAAV- 
U6-shRNA-hUbc-Flex-eGFP) were made by first replacing the Flex-GFP-Gephyrin 
cassette in pAAV-Flex-GFP-Gephyrin” with a Flex-eGFP cassette (resulting in 
pAAV-hUbc-Flex-eGFP) and then transferring the U6-shRNA cassettes from the 
pLentiLox constructs to pAAV-hUbc-Flex-eGFP. 

AAV constructs for the Cre-conditional co-expression of epitope-tagged IGF1.4 
and eGFP or of eGFP alone were cloned by synthesizing the gBlocks (Integrated 
DNA Technologies) and using the gBlocks as templates for PCR amplification; 
the respective PCR products were then cloned into the pAAV-hUbc-Flex-eGFP 
(see above) by replacing the EGFP with the respective insert. This strategy 
yielded plasmids termed pAAV-hUbc-Flex-SSHA-IGF1.4-Myc-F2A-eGFP and 
pAAV-hUbc-Flex-F2A-eGFP, whereby the Cre-dependent inserts were driven 
by a human ubiquitin promoter (hUbc). The sequence for Igf1.4 was based on 
NM_001111275 (base pairs 277-752) and was modified in the following way: an 
HA epitope (TATCCtTATGATGTTCCAGATTATGCT) was inserted in frame 
between the Igf1.4 signal sequence and the beginning of the coding sequencing 
(cds) of Igfl.4, Igfl.4 was rendered resistant to the shRNA against Igfl by 
introducing silent mutations into the target sequences specified above (sh1: 
TGTTGACGCGCTCCAATTT; sh2: TACGCCGGT TAGAAATGTA) and the 
followings tags were inserted in frame 3’ to the Igfl.4 coding sequencing: Myc 
epitope (GAACAAAAACTCATCTCAGAAGAGGATCTG), Furin cleavage site 
(CGGGCCAAGCGG) and a 2A peptide (GGCAGTGGAGAGGGCAGAGGA 
AGTCTTCTAACATGCGGTGACGTGGAGGAGAATCCCGGCCCT). The 
sequence for eGFP was based of the published sequence of eGFP. For pAAV- 
hUbc-Flex-F2A-eGFP a gBlock was synthesized containing the Furin cleavage site 
followed by the 2A site and eGFP. Detailed sequences are available upon request. 
Double-fluorescent ISH. For double-fluorescent in situ hybridization (FISH), 
wild-type C57Bl6 mice were dark-housed and light-exposed for 7.5h as described 
above. After light exposure, the brains were dissected and fresh frozen in Tissue- 
Tek Cryo-OCT compound (Fisher Scientific) on dry ice and stored at —80°C 
until use. 

FISH for Igfl was essentially done as described”*”*: riboprobes were prepared by 
in vitro transcription of linearized plasmids containing the template of the respec- 
tive probe. Riboprobes for Igf1 were labelled with UTP-11-Digoxigenin, while the 
riboprobes for the subtype markers (Gad1, Pvalb, Sst, Vip) were labelled with UTP- 
12-Fluorescein (Roche); all riboprobes were hydrolyzed to lengths of 200-400 bp 
after synthesis and validated for labelling with Dioxigenin or Fluorescein. For 
in situ hybridization, coronal sections (201m thick) of the visual cortex were cut on 
a cryostat and fixed in 4% paraformaldehyde for 10 min. Endogenous peroxidases 
were inactivated by treating the sections for 15 min in 0.3% HO) in PBS, and 
acetylation was performed as described. Pre-hybridization was done overnight at 
room temperature, and hybridization was performed under stringent conditions 
at 71.5°C. Following hybridization, stringency washes in SSC were performed 
as described at 65°C. For immunological detection of the first probe (Igf1), the 
tissue was first treated with a blocking step for 1 h in blocking buffer (B2) at room 
temperature before the anti-Digoxigenin-POD antibody (Roche) was applied at a 
concentration of 1:1000 in blocking buffer for 1 h at room temperature. Following 
three washes in buffer B1 and an additional wash in buffer TNT (0.1 M Tris-HCl 
pH 7.5, 0.15 M NaCl, 0.05% Tween20), the Igfl probe was detected by exposing 
the sections at room temperature in the dark for 20 min to TSA Plus Cy3 reagent 
(Perkin Elmer) diluted 1:100 in TSA working solution, after which the sections 
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were washed three times in TNT buffer. Before the immunological detection of 
the second probe, the peroxidases for detecting the first probe were inactivated by 
treating the sections for 30 min with 3% H Oz, followed by three washes in PBS. 
After an additional blocking step in blocking buffer for 1 h at room temperature, 
the anti-fluorescein-POD antibody (Roche) was applied at a concentration of 
1:1000 in blocking buffer overnight at 4°C. Following three washes in buffer B1 and 
an additional wash in buffer TNT, the probes of the subtype markers were detected 
by exposing the sections at room temperature in the dark for 15 min to TSA Plus 
Cy5 reagent (Perkin Elmer) diluted 1:100 in TSA working solution, after which 
the sections were washed three times in TNT buffer. Finally, the sections were 
counterstained with DAPI (4’,6-diamidino-2-phenylindole, Molecular Probes) and 
mounted using Fluoromount-G (Southern Biotech). In each experiment, controls 
for hybridization specificity were included (sense probe for Igf1) as well as controls 
for ensuring the specificity of the immunological detection of the digoxigenin- and 
fluorescein-labelled riboprobes. 

FISH for Crh, Prok2 and Fbln2 was done using the RNAscope system (Advanced 
Cell Diagnostic); this was necessary since no reliable signal could be detected 
with the method described above for Igfl FISH using DIG-labelled riboprobes. 
RNAscope probes for all genes were synthesized by ACD and all experiments were 
done according to the ACD’s protocol for fresh frozen brain sections. 

For quantifying of the expression pattern of all genes of interest (GOI, that is, 

Igf1, Crh and Prok2; Fbin2 could not be detected reliably), the visual cortices in each 
section were imaged on a Zeiss Axio Imager microscope with a 10x objective and 
3 x 5 fields-of-view were ‘stitched’ into one compound image; in all cases, image 
exposures were kept constant throughout a given experiment for each channel. 
Compound images of each visual cortex were then imported to Photoshop, and 
additional layers were created for each probe (that is, one layer for the GOI and 
one layer for the subtype marker in each compound image). The cells positive 
for each probe were then marked with a dot in the new respective layer by two 
independent investigators in a blinded manner (one investigator marking GOI- 
positive cells and the other investigator marking subtype-marker-positive cells). 
Finally, the layers containing the dots of the identified positive cells were compiled 
into a separate image file together with the DAPI-layer and imported into Image]. 
In Image], the images were analysed in a blinded manner by defining the visual 
cortex and its layers as regions of interest (ROI) based on the DAPI staining and 
quantifying the number of cells positive for either one or both markers per ROI. 
For each combination of probes (GOI together with each of the subtype markers), 
two visual cortices from four animals were analysed (a total of eight visual cortices 
for each combination). 
Virus production and neuronal cultures. Concentrated lentiviral stocks were 
prepared and titrated essentially as described*’. AAV stocks were prepared at the 
University of North Carolina (UNC) Vector Core and at the Children’s Hospital 
Boston Vector Core; see also Supplementary Table 8 for further details on AAV 
stocks. 

Primary cultures of cortical neurons were prepared from E16.5 mouse embryos 
as described®. In brief, 3 x 10° neurons per well were plated in 24-well dishes coated 
with poly-p-lysine (20 ,,gml~!) and laminin (3.4j1g ml). Cultures were maintained 
in neurobasal medium supplemented with B27 (Invitrogen), 1 mM L-glutamine, 
and 100 U ml"! penicillin/streptomycin, and one-third of the media in each well 
was replaced every other day. For testing of viral shRNA constructs, the cultures 
were infected at DIV 3 with concentrated viral stocks for 5 h at an MOI of 6. After 
infection, the cultures were washed twice in plain neurobasal medium after which 
the conditioned medium was returned to the dish and the cultures were continued 
to be maintained as described. At DIV 7, neuronal cultures were treated overnight 
with 1 1M TTX and 100 1M AP-5 to silence spontaneous activity before the cultures 
were depolarized at DIV 8 with 55 mM extracellular KCl as described® and lysed 
in TRIzol after 6 h of stimulation. 

Western blot for testing of IGF1 constructs and ELISA for determining serum 
IGF1 levels. HEK293T cells were used for testing the expression and the biological 
activity of the epitope-tagged IGF 1.4 constructs. HEK293T cells were cultured in 
DMEM (Life Sciences) containing 10% FCS and penicillin/streptomycin. Cells were 
transfected using lipofectamine (Life Technologies) and 18 h post transfection, 
the medium was replaced with DMEM containing 0.1% FCS; 42 h post transfection, 
the conditioned media were collected, spun down to remove cell debris and used 
immediately for stimulating non-transfected HEK293T that were serum starved 
for 3 days in DMEM containing 0.1% FCS. The conditioned media were applied 
to the serum starved cells for 15 min at 37°C after which the cells were lysed in 
boiling SDS sample buffer and subjected to Western blot analysis essentially as 
described®*". For detecting the (phosphorylated) IGF1-receptor, the following 
antibodies were used: anti-IGF 1-receptor-3 (D23H3) XP Rabbit mAb (#9750, Cell 
Signaling, 1:1000) and anti-phospho-IGF1-receptor-6 (Tyr1135/1136)/Insulin 
Receptor 8 (Tyr1150/1151) (19H7) Rabbit mAb (#3024, Cell Signaling, 1:1000). 
For determining serum IGF1 levels, we used the IGF1 Quantikine ELISA kit (R&D 
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Systems), following the manufacturer's instructions (P3 Vip-cre heterozygous pups 
were injected intracortically with the respective AAV and bled at P20). 
Perfusions, immunohistochemistry and morphological analysis of IGF1 cKO 
visual cortices. Mice were anaesthetized with 10% ketamine and 1% xylazine in 
PBS by intraperitoneal injection. When fully anaesthetized, the animals were tran- 
scardially perfused with ice-cold PBS for 5 minutes followed by 15 minutes of cold 
4% PFA in PBS. Brains were dissected and post-fixed for one hour at 4°C in 4% 
PFA, followed by three washes (each for 30 min) in cold PBS, and cryoprotection 
overnight in 20% sucrose in PBS at 4°C. The following day, brains were placed in 
Tissue-Tek Cryo-OCT compound (Fisher Scientific), frozen on dry ice and stored 
at —80°C. Coronal sections (201m thick) of the visual cortices were subsequently 
cut using a Leica CM1950 cryostat and used for subsequent experiments. 

For immunolabelling, the slides were blocked for 1 h with PBS containing 5% 
normal goat serum and 0.1% Triton X-100 (blocking solution). The samples were 
incubated overnight with different primary antibodies diluted in blocking solution, 
washed three times with PBS and then incubated for 45 min at room temperature 
with secondary antibodies and/or Hoechst stain (ThermoFisher Scientific). Slides 
were mounted in FluoromountG (Southern Biotech) and imaged on a Zeiss Axio 
Imager microscope. The following antibodies were used: mouse anti-HA (HA-7, 
Sigma; 1:1000), chicken anti-GFP (GFP-1020, Aves labs ; 1:1500), goat anti-mouse 
IgG (H+L) Alexa Fluor 488 (Highly Cross-Adsorbed, Life Technologies; 1:1,000), 
goat anti-chicken IgY (H+L) Alexa Fluor 488 (Life Technologies; 1:1,000). 

For analysing the brains of Igfl Vip-cre WT and cKO mice, brains of three-week- 

old WT and cKO littermates were placed on the same slide to minimize varia- 
tion. After cryosectioning, the slides were either counterstained immediately or 
stored at —20°C before they were counterstained and imaged. Counterstaining 
was done with DAPI (4’,6-diamidino-2-phenylindole, Molecular Probes) in PBS 
for 15-30 min at room temperature, after which the sections were washed once in 
PBS and mounted in Fluoromount-G (SouthernBiotech). For cell counting experi- 
ments, coronal visual cortex sections were imaged using a Zeiss Axio Imager 
microscope with a 10x objective and typically, 3 x 5 fields-of-view were ‘stitched’ 
into one compound image. In all cases, image exposures were kept constant 
throughout a given experiment for each channel. Custom ImageJ and MATLAB 
macros were used to quantify the area of each cortical layer, the number tdTomato- 
positive cells per layer, and the size of tdTomato-postive cells. Briefly, regions of 
interest (ROI) encompassing the visual cortex and its layers were defined based 
on the DAPI counterstaining. While the width of these ROIs was kept constant 
throughout the analysis of all sections, the height of the ROIs was adjusted in each 
image according to the DAPI counterstaining in each section and the areas of each 
layer in each section were recorded. For analysing the number and soma size of 
tdTomato-postive cells in each layer, a threshold for each channel was determined 
based on multiple user-defined negative regions. Channels were thresholded and 
binarized, and a mask of each channel was created. The number of tdTomato- 
positive cells was determined by taking the logical AND of the DAPI and tdTomato 
channel masks and counting the number of components greater than 4 pixels in 
size in the double overlap of the masks of the two channels in each layer ROI. The 
soma size was calculated as the area of these double-overlapping components. 
Three animals per genotype and 4-6 visual cortex sections per animal were ana- 
lysed, and these data were used to determine the mean and s.e.m. of the values 
reported for each genotype. 
Stereotaxically guided surgery and intra-cortical injections of AAV constructs. 
All surgeries were performed according to protocols approved by the Harvard 
University Standing Committee on Animal Care and were in accordance with 
federal guidelines. Surgeries were performed on mice between P14 and P15. 
Animals were deeply anaesthetized by inhalation of isoflurane (initially 3-5% 
in Oo, maintained with 1-2%) and secured in the stereotaxic apparatus (Kopf). 
Animal temperature was maintained at 37°C. The fur was shaved and scalp cleaned 
with betadine and 100% ethanol three times before an incision was made to expose 
the skull. Injections into the visual cortex were made by drilling a ~0.5 mm burr 
hole (approximately 2.7 mm lateral, 0.5 mm anterior to lambda) through the skull, 
inserting a glass pipette to a depth of 200-400 1m and injecting 250 nl of the respec- 
tive AAV construct at a rate of 100nl min“ |. Five minutes post-injection, the glass 
pipette was retracted, the scalp sutured and the mouse returned to its home cage. 
All animals were monitored for at least one hour post-surgery and at 12h inter- 
vals for the next 5 days. Post-operatively, analgesic (flunixin, 2.5 mg per kg) was 
administered at 12h intervals for 72 h. 

For neonatal injections, pups post-natal day 3-5 were anaesthetized on ice for 
2-3 min, and secured to a stage where their head was supported using a clay mould 
using standard lab tape. A bevelled glass pipette was lowered into visual cortex 
(approximately 2 mm lateral, 0.2 mm anterior to lambda), and 50 nl of the respec- 
tive AAV virus was injected at a rate of 23 nlsec!. Injections were made into eight 
sites (four on each hemisphere), and the mouse was then allowed to recover on a 
37°C warm plate before being returned to the home cage. 


For bilateral stereotaxic intra-cortical injections of AAV constructs for visual 

plasticity experiments, surgeries were performed on mice between P18 and P20. 
Animals were anaesthetized with isofluorane gas (1-2% in O2), and body tem- 
perature was maintained at around 37°C with a heating pad during surgery. The 
head was held in place by standard mouse stereotaxic frame. The fur was shaved 
and scalp cleaned with betadine and 100% ethanol three times before an incision 
was made to expose the skull. Burr holes were drilled into the skull at the point 
of injection guided by stereotaxic coordinates and blood vessel patterns (approxi- 
mately 2mm and 2.7 mm lateral, 0.5 mm anterior to lamba) on both hemispheres. 
A 28-gauge Hamilton syringe (701RN) was inserted to a depth of 200-300 1m 
and 250 nl of the respective AAV construct was injected at the rate of 50 ml min“!. 
Five minutes post-injection, the Hamilton syringe was retracted, the scalp sutured 
and the mouse returned to its home cage. All animals were monitored for at least 
one hour post-surgery. Post-operatively, analgesic (meloxicam, 5-10 mg kg!) was 
administered every 24h for 2 days. 
Electrophysiology. Coronal sections (300 1m thick) containing the primary visual 
cortex were cut from P19-P21 mice using a Leica VT1000S vibratome in ice-cold 
choline dissection media (25 mM NaHCOs, 1.25 mM NaH>PO,, 2.5mM KCl, 7mM 
MgCl, 25mM glucose, 0.5mM CaCh, 110mM choline chloride, 11.6 mM ascorbic 
acid, 3.1mM pyruvic acid). Slices were incubated in artificial cerebral spinal fluid 
(ACSF, contains 127 mM NaCl, 25 mM NaHCO, 1.25mM NaH>POg,, 2.5mM KCl, 
2.5mM CaCh, 1mM MgCh, 25 mM glucose) at 32°C for 30 min immediately 
after cutting, and subsequently at room temperature. All solutions were saturated 
with 95% O3/5% COs, and slices were used within 6 h of preparation. Whole-cell 
voltage-clamp recordings were performed in ACSF at room temperature from 
neurons in primary visual cortex that were identified under fluorescent and DIC 
optics. Recording pipettes were pulled from borosilicate glass capillary tubing with 
filaments using a P-1000 micropipette puller (Sutter Instruments) and yielded 
tips of 2-5.5 MQ resistance. All experiments were recorded with pipettes filled 
with 135 mM caesium methanesulfonate, 15 mM HEPES, 0.5mM EGTA, 5mM 
TEA-Cl, 1mM MgCh, 0.16 mM CaCh, 2mM Mg-ATP, 0.3mM Na-GTP, 10mM 
phosphocreatine (Tris), and 2mM QX-314-Cl. Osmolarity and pH were adjusted 
to 310 mOsm and 7.3 with Millipore water and CsOH, respectively. Recordings 
were sampled at 20 kHz and filtered at 2 kHz. mEPSCs were isolated by holding 
neurons at —70 mV and exposing them to 0.5 1M tetrodotoxin, 50M picrotoxin 
and 251M cyclothiazide and were blocked by application of 25 |1M NBQX and 
50\1.M CPP. mIPSCs were isolated by holding neurons at 0 mV and exposing 
them to 0.541M tetrodotoxin, 251M NBQX, and 50,1M CPP and were blocked by 
501M picrotoxin. Data were acquired using either Clampex10 or custom MATLAB 
software, using either an Axopatch 200B or Multiclamp 700B amplifier, and 
digitized with a DigiData 1440 data acquisition board (Axon Instruments) or a 
PCle-6323 (National Instruments). For measuring miniature postsynaptic currents 
(minis), cells were allowed to stabilize for at least two minutes. 

For paired pulse experiments, no drugs were used in the ACSF. A stimulating 
electrode (ISO-Flex, A.M.P..) was positioned approximately 100 1m below the cell, 
and 0.1 ms electrical pulses were given while adjusting the stimulus intensity and 
electrode position until the first pulse was between 100 and 500 pA. Inter-stimulus 
interval was varied and 10s elapsed between each sweep. Pulse amplitudes were 
obtained from average sweeps of at least ten trials. Cells were held at 0 mV to record 
IPSCs and —70 mV to record EPSCs. 

For evoked IPSCs, no drugs were used in the ACSE. Simultaneous paired whole-cell 
recordings were obtained from an eGFP-expressing VIP neuron and a morpho- 
logically identified pyramidal neuron located not more than five cell bodies away 
from the VIP neuron. Both cells were held at 0 mV, and a 5 ms light pulse from a 
blue LED (Thorlabs) was used to evoke IPSCs. Light intensity and the objective 
position were varied until the VIP neuron IPSC amplitude was between 200 and 
500 pA. Average light power at 470 nm varied from between 0.3 and 0.7 mW over 
the course of the experiment. Reported ratios were obtained by dividing IPSC 
amplitudes obtained from an average trace of at least ten trials. 

For electrophysiology experiments, n was set to min n = 10 to detect 20% effect 
size with power 0.95. For experiments to determine average firing rate of VIP neu- 
rons, a modified ACSF that promotes increased action potential firing was used 
containing, 3.5mM KCl and 0.8mM CaCl,. Cell-attached patch recordings were 
obtained from eGFP-positive cells. Cells that did not fire an action potential in the 
first 30s of recording were discarded, and recordings were maintained for at least 
30 ten-second sweeps. Average firing rate was determined from the first sweep to 
the last recorded sweep in which an action potential occurred. 

Miniature IPSC and EPSC data were analysed using Axograph X. Events were 
identified using a variable amplitude template-based strategy. Templates for each 
event type were defined as follows: mEPSC: 0.25 ms rise time, 3 ms decay 7, ampli- 
tude threshold of —3 x s.d. local noise; mIPSC: 1 ms rise time, 50 ms decay 7, ampli- 
tude threshold of 2.5 x s.d. local noise. Local noise was determined by calculating 
the standard deviation of the current in a 5 ms window before event rise onset. 
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Templates lengths extended 25 ms after rise onset in the case of mEPSCs and 50 ms 
after rise onset in the case of mIPSCs. Events were discarded if they had a rise time 
outside the range of 0-3 ms. Statistical significance for all recorded parameters 
between genotypes was evaluated using a Mann-Whitney U-test on the mean 
values from individual neurons in a given experiment. 

Minis were additionally evaluated for significance using both a Kolmogorov- 
Smirnov test (KS test) and Monte Carlo test. For these tests, 50 random minis were 
sampled from each neuron in each condition to obtain a continuous distribution 
for each condition that equally weighted each cell in that condition: these distri- 
butions are the data shown in the cumulative distribution graphs. One hundred 
random events were randomly sampled from these distributions for a KS test; 
and for Monte Carlo tests, 100 random events were randomly sampled from each 
distribution 1,000 times (with replacement), and the means were compared. All 
significant differences in mini amplitude and frequency were found to be sig- 
nificant by Monte Carlo test, KS test, and Mann-Whitney U-test of cell means. 
Since the Mann-Whitney test was found to be the most stringent test, the P values 
from Mann-Whitney tests are reported. All data was analysed blind to genotype 
or experimental condition. In all conditions, series resistance, holding potential, 
cell capacitance, and input resistance were recorded and were not found to be 
significantly different except where noted. Statistical tests were performed using 
Graphpad Prism and MATLAB. 

Sholl analysis. VIP neurons were filled with a patch pipette containing 1% Alexa 
647 Hydrazide and the internal solution was allowed to dialyze for at least 30 min 
before slices were fixed in 4% paraformaldehyde for 1h at room temperature. 
Slices were then washed three times for 30 min in PBS before slices were mounted 
in Fluormount-G (Southern Biotech). Images were acquired using a Zeiss Axio 
Imager microscope with a 20x objective with the use of an Apotome (Zeiss). 
Neurons were reconstructed using NeuronJ (ImageJ), and Sholl analysis was 
performed using a custom script in MATLAB. 

Monocular deprivation (MD) procedure. Eyelids were trimmed and sutured 
under isoflurane anaesthesia (1-2% in O2) as previously described*". The integrity 
of the suture was checked daily and mice were used only if the eyelids remained 
closed throughout the duration of the deprivation period. One eye was closed for 
4 days starting between P26 to P28. The eyelids were reopened immediately before 
recording, and the pupil was checked for clarity. 

Mouse visual-evoked potential (VEP). VEPs were recorded from anaesthetized 
mice (50mg kg~! Nembutal and 0.12 mg chlorprothixene) using standard tech- 
niques described previously’. The contra- and the ipsilateral eye of the mouse were 
presented with horizontal black and white sinusoidal bars that alternated contrast 
(100%) at 2 Hz. A tungsten electrode was inserted into the binocular visual cortex 
at 2.8mm from the midline where the visual receptive field was approximately 
20° from the vertical meridian. VEPs were recorded by filtering the signal from 
0.1-100 Hz and amplifying 10,000 times. VEPs were measured at the cortical depth 
where the largest amplitude signal was obtained in response to a 0.05 c.p.d. stimulus 
(400-600 jm); 3-4 repetitions of 20 trials each were averaged in synchrony with 
the abrupt contrast reversal. The signal was baseline corrected to the mean voltage 
of the first 50 ms post-stimulus-onset. VEP amplitude was calculated by finding 
the minimum voltage (negative peak) within a 50-150 ms post-stimulus-onset 
time window. Acuity was calculated only from the deprived eye. For each different 
spatial frequency, 3-4 repetitions of 20 trials each were averaged in synchrony 
with the abrupt contrast reversal. VEP amplitude was plotted against the log of 
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the different spatial frequency, and the threshold of visual acuity was determined 
by linear extrapolation to 0;1V. 

Animal husbandry and colony management. [gf] conditional knockout mice’, 
Ai9 tdTomato reporter mice**, Emx1-cre*4, Pv-cre**, Gad2-cre, Sst-cre, Vip-cre** 
and RiboTag mice® are available from The Jackson Laboratory. 

For routine experimentation, animals were genotyped using a PCR-based strat- 
egy; PCR primer sequences are available at the The Jackson Laboratory’s website. 
For RiboTag experiments, mice homozygous for the RiboTag allele were crossed 
to mice homozygous for the cre allele and all experiments were performed with 
mice double heterozygous for both the RiboTag and the cre alleles. For Igfl cKO 
experiments, mice heterozygous for the Igf1 conditional allele (Igf”“") and 
homozygous for the Vip-cre allele were crossed to mice heterozygous for the Igfl 
conditional allele and homozygous for the tdTomato reporter allele. Resulting 
littermates all had one copy of the Vip-cre transgene and the tdTomato Cre reporter 
and yielded Igf1"/"" and IgfJ™" littermates for experimentation. For injections 
of AAV constructs in the visual cortices of Cre mice (Vip-, Pv-, Sst-, or Emx1-cre), 
mice homozygous for the cre allele were crossed to wild-type C57Bl6 mice and 
offspring heterozygous for the cre allele were used for experiments. The use of 
animals was approved by the Animal Care and Use Committee of Harvard Medical 
School and/or the University of California Berkeley. 
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Extended Data Figure 1 | Validation of the sensory stimulation protocol 
and the RiboTag-based cell-type-specific purification of mRNA. 

a, Quantitative real-time PCR (qPCR) for known experience-regulated 
genes on RNA isolated from the visual cortex of mice that were dark- 
housed for varying durations (overnight, 3 days, 7 days or 14 days) and 
then either euthanized in the dark or exposed to light for 1, 3 or 7.5h, and 
then euthanized. Data are normalized to the maximal value in each data 
set and represent the mean and standard error of four biological replicates. 
b, qPCR for known experience-regulated genes on RNA isolated from the 
visual cortex of mice that were dark-housed for 14 days and then either 
exposed to light for 1, 3 or 7.5h (dark + light, black) or kept in the dark 


during these hours (dark + dark, red). All mice of a given time point 

were dissected in very close temporal proximity. Data are normalized to 
the maximal value in each data set and represent the mean and standard 
error of four biological replicates. c, qPCR for known cell-type-specific 
marker genes on RNA isolated from RiboTag mice expressing Cre in 
distinct neuronal subtypes. Data are normalized to the maximal value in 
each data set and represent the mean and standard error of three biological 
replicates. d, qPCR for known early-induced transcription factors on RNA 
isolated from RiboTag mice expressing Cre in distinct neuronal subtypes. 
Data are normalized to the maximal value in each data set and represent 
the mean and standard error of three biological replicates. 
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Biological Replicate 1 
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1/3/7.5h of light exposure after dark housing, respectively). b, Line plots of three biological replicates. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Characterization of the experience-induced 
gene programs in subtypes of cortical neurons. a, Average matrix of 
Spearman correlation coefficients computed from the expression values of 
1000 random sets of 602 genes (including experience-regulated genes, with 
replacement). b, Matrix of the Spearman correlation coefficients computed 
from the expression levels of control transcripts that match the expression 
distribution of experience-regulated genes (n = 602). c, Cladogram 
resulting from hierarchical clustering of all samples (except samples from 
Gad2-cre). Cladograms were computed using the mean expression values 
(that is, normalized reads across all exons of a gene) for all expressed 
transcripts (n = 14,280). d, Cladogram resulting from hierarchical 
clustering of the mean expression values of a set of control transcripts that 
match the expression distribution of experience-regulated genes (n = 602). 
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e, Pie charts showing the subdivision of experience-regulated genes on 

the basis of kinetics in each set of Cre lines (red, rapidly induced; grey, 
induced with delayed kinetics; orange, induced only after two weeks of 
dark housing; green, rapidly suppressed; magenta, suppressed with delayed 
kinetics; blue, suppressed only after two weeks of dark housing). f, Left, 
matrix of Spearman correlation coefficients between Cre lines computed 
using the mean expression values (normalized reads across all exons of a 
gene) of early-induced genes one hour after light exposure. Right, matrix 
of Spearman correlation coefficients between Cre lines computed using 
the mean expression values of late-induced genes 7.5h after light exposure. 
For each matrix, the correlations upon permuting the expression values 
are also shown (colour scale at right, scale begins at zero.) 
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Extended Data Figure 4 | Characterization of cell-type-specific and 
experience-induced genes in subtypes of cortical neurons. a, Scatter 
plots showing the log; expression values for each expressed gene ina 
given Cre line (x axis) plotted against the maximum logjo expression 
values for that gene found in all other Cre lines (y axis). Black line denotes 
unity, and the red line is the 5.5-fold enrichment threshold set to include 
Vglutl as a cell-type-specific gene in Emx1-cre. Data represent the mean 
values of three biological replicates. b, Scatter plots of all expressed genes, 
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for each Cre line plotting the mean log; fold enrichment in that Cre line 
(x axis) against the mean log, of the absolute value of the maximum fold 
change observed in that Cre line. Data represent the mean values of three 
biological replicates. Genes that pass both enrichment and induction 
thresholds in 3 of 3 biological replicates are shown in red. c, Bar graph 
showing the maximum expression value (in normalized reads) for 
VIP-neuron-specific experience-regulated genes. 
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neurons. a, Example image of cortices from Igf1 wild-type (WT) (b), number of VIP neurons per image per layer (c), or soma size of VIP 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Igfl knockdown in VIP neurons affects 
inhibitory inputs onto VIP neurons but not onto neighbouring 
neurons. a, AAV shRNA constructs. shRNA cassettes against Igf1 or a 
control gene (Luc) were cloned downstream of the U6 promoter into 

an AAV vector that drives Cre-dependent expression of eGFP. b, qPCR 
validation of the efficacy of Igfl shRNA constructs. Cultured cortical 
neurons were infected with lentiviral constructs either expressing no 
shRNA (vector only), a control shRNA (against Luc) or shRNAs against 
Igfl. Four days post-infection the cultures were quieted overnight with 
TTX and AP-5 and then harvested either before or after being depolarized 
with 55 mM KCI for 6h; RNA was then isolated and qPCR was performed. 
Data are normalized to the maximal value in each replicate and represent 
the mean and standard error of three biological replicates. c, Bar graph 
showing normalized soma size of P21 visual cortex VIP neurons infected 
with control shRNA or shRNA targeting Igf1 (shRNA control, n = 103; 
shRNA Igf1, n= 174; P=0.41, Mann-Whitney U-test). d, Bar graphs 
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showing the levels of IGF1 in the serum of P20 mice that were injected 
intracortically with AAVs driving the expression of control shRNA (black) 
or Igfl shRNA. Data represent the mean and s.e.m. of the serum IGF1 
levels of four mice per group. e-l, Bar graphs and cumulative distribution 
plots showing mIPSC amplitudes and frequency/inter-event interval upon 
early widespread knockdown of Igf1 in VIP (e, f), SST (g, h), PV (i, j) and 
excitatory (k, 1) neurons after injection of AAVs into P3 cortices of the 
respective Cre mice. VIP neurons (identified as eGFP-positive cells in 
Vip-cre mice): control and Igfl shRNA, amplitude P= 0.96, frequency 
P=0.04. SST neurons (identified as eGFP-positive cells in Sst-cre mice): 
control and Igfl shRNA, amplitude P= 0.89, frequency P= 0.55. PV 
neurons (identified as eGFP-positive cells in Pv-cre mice): control and 
Igfl shRNA, amplitude P = 0.084, frequency P= 0.93. Pyramidal neurons 
(identified by morphology): control and Igf1 shRNA, amplitude P = 0.84, 
frequency P= 0.15). All P values are derived from Mann-Whitney U-tests; 
numbers inside bars indicate the number of cells recorded. 
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Extended Data Figure 7 | See next page for caption. 
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Extended Data Figure 7 | Effects of IGF1 overexpression in excitatory 
and SST-positive neurons. a, RiboTag-seq identifies Igf1.4 as the 

major Igfl isoform expressed in VIP neurons. Representative tracks of 
histograms of the RNA-seq reads in each Cre line across the Igfl genomic 
locus. Data are from the 7.5h light exposure RiboTag-seq data sets. 

b, AAV constructs for the Cre-dependent expression of HA-/Myc-tagged 
IGF1 (Igf1.4) and eGFP (that is, IGF1-OE, top) or of eGFP alone (that is, 
control, bottom). F2A, Furin cleavage site followed by the 2A peptide; 
black and white triangles represent a Cre-dependent Flex-switch. 

c, Western blot analysis of IGF1-receptor activation status in lysates of 
serum starved HEK293T cells that were stimulated with conditioned 
media (CM) containing epitope-tagged isoforms of IGF1. CM was 
produced by transfecting HEK293T cells with the respective construct and 
collecting the culture media. IGF1-receptor is detected with antibodies 
against either activated IGF1 receptor (anti-IGF1R pTyr!!"') or total 
IGF1 receptor (anti-IGF1R total). Molecular weight markers are on the 
right and the arrow indicates the band of the IGF1 receptor. d, e, Bar 
graphs and cumulative distribution plots showing mIPSC frequency/ 


LETTER 


inter-event interval (d) and amplitude (e) of mIPSCs recorded from eGFP- 
positive neurons in P20 Sst-cre mice that were intracortically injected with 
AAVs driving the expression of control (C) or IGF1-OE (OE) constructs 
(amplitude, P= 0.16; frequency, P= 0.01; Mann-Whitney U-test; numbers 
inside bars indicate the number of cells recorded). f, g, Bar graphs and 
cumulative distribution plots showing mIPSC frequency/inter-event 
interval (f) and amplitude (g) of mIPSCs recorded from eGFP-positive 
neurons in P20 Emx1-cre mice that were intracortically injected with 
AAVs driving the expression of control (black, n = 20) or IGF1-OE AAVs 
(red, n= 20). Amplitude, P= 0.99; frequency, P=0.01, Mann-Whitney 
U-test. h-j, Scatter plots of IGF 1-interacting proteins showing the logio 
normalized mean expression values in Vip-cre neurons versus each of 

the other Cre lines (Emx1 (h), Pv (i), Sst (j)). k, Quantification of EPSC 
paired-pulse recordings from VIP neurons infected with control shRNA- 
(black n= 9) or Igfl shRNA- (red n= 11) expressing AAVs. The ratio of 
the second EPSC amplitude divided by the first EPSC amplitude is plotted 
against inter-stimulus interval (P=0.1, two-way ANOVA). 
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Extended Data Figure 8 | VIP neuron-derived IGF1 does not disrupt 
ocular dominance plasticity. a, Widespread infection of VIP neurons 

by AAV-shRNA-hUbc-Flex-eGFP. High-titre injection of AAVs into the 
visual cortex of P18-20 Vip-cre/+ mice leads to infection of the majority 
VIP neurons (green, eGFP; red, anti- VIP; blue, DAPI; arrowheads, 
infected VIP neurons; scale bars, 150m, 20m in the inlet). b, c, Average 
of VEP traces recorded in the visual cortices of mice that were injected 
with AAVs expressing control shRNA (black/grey) or shRNA against 

Igf1 (red/pink) shRNA and that were (grey, pink) or were not (black, 

red) subjected to monocular deprivation in the eye contralateral to the 
recording site (MD versus NoMD, respectively). d, Monocular deprivation 


induces a significant reduction in the VEP amplitude in response to low 
spatial frequency stimulation in mice that had AAVs expressing control 
shRNA and Igf1 shRNA injected into their visual cortices (control shRNA 
NoMD, n=5 mice; control shRNA MD, n=7 mice; Igfl shRNA NoMD, 
n=5; Igfl shRNA MD, n=8. *P < 0.05, Mann-Whitney U-test). e, Mice 
that had AAVs expressing control shRNA (black) and Igf1 shRNA (red) 
injected into their visual cortices display normal ocular dominance 
plasticity as monocular deprivation (MD) induces a shift to the ispilateral 
eye in both groups (control shRNA NoMD, n=5 mice; control shRNA 
MD, n=7; Igfl shRNA NoMD, n=5 ; Igfl shRNA MD, n=8 mice. 

** P< 0.0001, one-way ANOVA with Tukey’s post hoc test). 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature17000 


Co-ordinated ocular development from human 
iPS cells and recovery of corneal function 


Ryuhei Hayashi!*, Yuki Ishikawa’, Yuzuru Sasamoto’, Ryosuke Katori?, Naoki Nomura’, Tatsuya Ichikawa’, Saori Araki’, 
Takeshi Soma’, Satoshi Kawasaki*, Kiyotoshi Sekiguchi’, Andrew J. Quantock‘, Motokazu Tsujikawa? & Kohji Nishida? 


The eye is a complex organ with highly specialized constituent 
tissues derived from different primordial cell lineages. The retina, 
for example, develops from neuroectoderm via the optic vesicle, 
the corneal epithelium is descended from surface ectoderm, while 
the iris and collagen-rich stroma of the cornea have a neural crest 
origin. Recent work with pluripotent stem cells in culture has 
revealed a previously under-appreciated level of intrinsic cellular 
self-organization, with a focus on the retina and retinal cells!~>. 
Moreover, we and others have demonstrated the in vitro induction 
of a corneal epithelial cell phenotype from pluripotent stem cells’. 
These studies, however, have a single, tissue-specific focus and 
fail to reflect the complexity of whole eye development. Here we 
demonstrate the generation from human induced pluripotent stem 
cells of a self-formed ectodermal autonomous multi-zone (SEAM) 
of ocular cells. In some respects the concentric SEAM mimics 
whole-eye development because cell location within different zones 
is indicative of lineage, spanning the ocular surface ectoderm, lens, 
neuro-retina, and retinal pigment epithelium. It thus represents 
a promising resource for new and ongoing studies of ocular 
morphogenesis. The approach also has translational potential 
and to illustrate this we show that cells isolated from the ocular 
surface ectodermal zone of the SEAM can be sorted and expanded 
ex vivo to form a corneal epithelium that recovers function in an 
experimentally induced animal model of corneal blindness. 

To generate a SEAM of ocular cells, human induced pluripotent 
stem (iPS) cells were cultivated in differentiation medium in which 
they spontaneously and progressively formed a primordium compris- 
ing four identifiable concentric zones (Fig. 1a, Extended Data Fig. 1a). 
Cell morphology in each zone was distinctive, creating a visible delin- 
eation between zones (Extended Data Fig. 1b). The innermost central 
area (zone 1) formed first and this was followed by the emergence 
of three more radially distant concentric cell populations; zones 2-4. 
(Fig. 1b, Supplementary Video). In our experiments 7.7 + 1.8% 
of human iPS cells formed colonies and 67.9 + 4.9% of these 
resulted in the generation of a SEAM (n=5 technical replicates). 
Immunolabelling for the neural cell marker class II 8-tubulin 
(TUBB3) was positive in zones 1 and 2, but not, more peripherally, 
in zones 3 or 4 (Fig. 1c). Cells in zones 1-3 expressed the ocular cell 
marker PAX6, while those in zones 3 and 4 were positive for the 
epithelial/surface ectodermal markers p63 and E-cadherin (Fig. 1d, e, f). 
Thus, in a number of respects SEAM formation in two-dimensions 
mirrors whole eye development from the front of the ocular surface 
posteriorly to the retina (Fig. 1g). 

Cells in zone 1 expressed neural cell-specific markers TUBB3, SOX2 
and SOX6, but no surface ectodermal markers (Fig. 2a). Accordingly, 
this central area was deemed to represent presumptive neuroectoderm. 
Zone 2 cells primarily expressed the optic vesicle marker RAX and 
the neural crest marker SOX10 (Fig. 2a). In eye development various 


cell types assume a tissue-specific arrangement at the juxtaposition of 
the embryonic anterior and posterior eye segments (Extended Data 
Fig. 1c), and perhaps this is reflected here. We also note that zone 2 
frequently contained retinal pigment epithelium (RPE)-like cells, and 
that SOX10*/p75* neural crest cells were found to have emerged in 
satellite spheres in the presumptive zone 2 after two weeks in culture 
(Fig. 2b, Extended Data Fig. 1d). Differentiation of SEAMs by a pro- 
tocol which encourages retinal differentiation further demonstrated 
that CHX10* neuro-retinal cells and MITF* RPE cells were present 
in zone 2 towards its inner and outer margins, respectively (Fig. 2c, d). 
Collectively, the findings imply that zone 2 of the ocular SEAM is 
a developmental analogue of neural eye tissues comprising the 
neuro-retina, neural crest and RPE. 

At the margin of zones 2 and 3 a-crystallin™ lens cell clusters 
emerged after four weeks in culture and spread further through the 
SEAM by week six (Fig. 2e, Extended Data Fig. le). Cells in zone 3 
did not display any neural features and this region was unique with its 
PAX6/p63-double-positive phenotype, representative of ocular surface 
ectoderm. Zone 3 cells also specifically expressed ocular surface ecto- 
derm markers such as PAX6, deltaN (DN)-p63, K18, and E-cadherin, 
but no neural cell markers (Fig. 2a). Thus, cells in zone 3 are consid- 
ered to be anlages of the ocular surface epithelium. Cells in zone 4 
expressed epithelial genes DN-p63 and E-cadherin, and did not express 
PAX6. This points to their identity as general surface ectodermal cells, 
which will probably differentiate into epidermal keratinocytes. The 
interactions of the different cell lineages complicit in whole eye for- 
mation in situ are thus mimicked in the research described here as 
illustrated schematically in Figs 1g and 2f. 

As mentioned, cells with retina-like characteristics have been 
generated from human iPS cells!~°, but functional ocular surface tissue 
has not. Thus, we sought to form a transplantable corneal epithelium; 
(i) as a conceptual example of the translational potential of the SEAM 
and (ii) to demonstrate that functional anterior eye tissue can indeed 
be fashioned from human iPS cells (Fig. 3a). At two and four weeks in 
culture cells in zone 3 co-expressed PAX6 and p63, first partially then 
fully. After isolation by the manual pipetting of cells in other zones 
(Extended Data Fig. 2a, b), zone 3 cells were also found to express K14 
and cornea-specific keratin K12'° by 8-12 weeks (Fig. 3b). This was 
confirmed by qRT-PCR analysis (Extended Data Fig. 2c). Vimentin 
positive (VIM*) stroma-like cells were also present in the p63* zone 3 
at eight weeks (Extended Data Fig. 2d). Cellular differentiation in 
zone 3 of the SEAM thus follows that seen in the embryonic (mouse) 
eye (Extended Data Fig. 2e). BMP signalling influences the develop- 
ment of the surface ectoderm!!”, and we found that zone 3 of the 
SEAM was abolished following its treatment with the BMP4 inhib- 
itors Noggin and LDN-193189 (Extended Data Fig. 3). The TGF3 
inhibitor, SB-431542, also disturbed the typical multi-zone forma- 
tion (Extended Data Fig. 3), suggesting that the suppression of ocular 
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Figure 1 | Differentiation of multiple ocular cells in the SEAM. 

a, A typical SEAM of differentiated human iPS cells after 40 days of 
culture. Representative of 19 independent experiments. Scale bar, 200 jim. 
b, Time-lapse microscopy of the differentiating human iPS cells during the 
first 25 days (d) of culture. Representative of six independent experiments. 
Scale bar, 200 1m. See also Supplementary Video 1. c-f, Immunostaining 
of TUBB3 (green) and p63 (red) (c, zones 1-3), PAX6 (green) and p63 
(red) (d, zones 1-3), E-cadherin (Ecad, green) and p63 (red) (e, zones 
1-3), and PAX6 (green) and p63 (red) (f, zones 2-4) in the SEAM after 

six to seven weeks of culture. Images are representative of three or four 
independent experiments. Nuclei, blue. Scale bar, 100 1m. g, The SEAM 
of human iPS cells induced different kinds of cells of ectodermal lineage, 
mimicking anterior and posterior eye development in vivo. CNS, central 
nervous system; NE, neuroectoderm; OC, optic cup; NR, neuroretina; NC, 
neural crest; LE; lens; OSE, ocular surface ectoderm; SE, surface ectoderm; 
CE, corneal epithelium; EK, epidermal keratinocyte. 


surface ectodermal commitment is caused by the inhibition of early 
developmental events induced by endogenous BMP/TGEF8. 

As alluded to above, cells in SEAM zone 3 most closely resemble 
those of the presumptive ocular surface, and these were investigated 
for their translational, regenerative potential. After removal of zones 
1 and 2 from the SEAM by manual pipetting, the cells that remained 
(that is, those in zone 3 and some in zone 4) were subjected to FACS 
to isolate a specific ocular surface lineage—corneal or conjunctival— 
and the results of this are shown in Extended Data Fig. 4a. SSEA-4 is a 
common marker of pluripotent stem cells and is specifically expressed 
in corneal epithelial cells in vivo’, including stem/progenitor cells 
(Extended Data Fig. 4b). Here, the use of SSEA-4 and the basal epithe- 
lial marker ITGB4 revealed that 14.1% of cells were SSEA-4*/ITGB4* 
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Figure 2 | Characterization of cell zones in the SEAM. a, Gene 
expression analysis for ectoderm-related markers in each zone of the 
SEAM (zones 1 and 2, n= 12 colonies; zones 3 and 4, n =7 colonies 
obtained from two independent experiments, respectively). Error bars 
are s.d. b, Phase-contrast image and SOX10 (red) expression around 
SEAM zones 1 and 2 after two weeks of culture (representative of three 
independent experiments). Nuclei, blue. Scale bar, 200 um. ¢, Bright- 
field (BF) appearance of neuroretinal cells (CHX10*, red) and RPE 

cells (MITF*, green) in zone 2 after seven weeks of culture using the 
protocol for retinal differentiation (representative of three independent 
experiments). Nuclei, blue. Scale bar, 200 1m. d, Cultivation of the isolated 
RPE cells (representative of three independent experiments). Scale bars, 
200 1m (upper panel) and 501m (lower panel). e, Immunostaining of 
lens differentiation marker, a-crystallin (green) and p63 (red), after four 
weeks of culture (representative of three independent experiments). 
Nuclei, blue. Scale bar, 100 1m. f, Schematic of the expression pattern of 
ocular-cell-related genes in the SEAM. 


(P3 cells), whereas 16.6% were SSEA-4/ITGB4* (P2 cells) (Fig. 3c). 
Thus, the population contains both corneal and non-corneal epithelial 
cells, with the indication that most colony-forming cells are derived 
from P2 and P3 fractions (Fig. 3d). Unlike P2, however, a significant 
proportion of the P3 colonies expressed PAX6 (Extended Data Fig. 4c). 
P3 cells also had higher expression levels of corneal epithelial- 
specific markers PAX6, K12, CLU, and ALDH3A1", but lower levels 
of the epidermal marker K10 and the mucosal epithelial marker K13 
(Extended Data Fig. 4d). 

P3 cell-derived colonies mostly consisted of PAX6*/K12* corneal 
epithelial colonies along with a lower number of PAX6+/K12!" limbal 
epithelial colonies, which are assumed to represent corneal epithelial 
stem/progenitor cells (Fig. 3e, Extended Data Fig. 5a). The limbus, at 
the edge of the cornea, is where corneal epithelial stem/progenitor 
cells are widely believed to reside. Stratified epithelia derived from 
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Figure 3 | Induction and isolation of ocular surface epithelial cells 
from the SEAM. a, Schematic representation of the strategy used for the 
generation of the ocular surface ectoderm and epithelium. CDM, corneal 
differentiation medium; CEM, corneal epithelium maintenance medium; 
CMM, corneal epithelium maturation medium; DM, differentiation 
medium; w, week(s). b, Immunostaining for ocular surface and corneal 
epithelial development-related markers, PAX6 (green), p63 (red), K14 
(green), and K12 (red) during ocular surface epithelial differentiation 
culture (1-12 weeks, representative of three independent experiments). 
Nuclei, blue. Scale bar, 100 1m. ¢, A flow cytometric analysis of SSEA-4, 
ITGB4 and TRA-1-60 in the differentiated human iPS cells after 

10-15 weeks of culture revealed no undifferentiated human iPS cells 
(SSEA-4*/TRA-1-60* cells) (left panels). TRA-1-60~ cells were analysed 
by SSEA-4 and ITGB4 (right panel). The four populations are defined as 


P3 cells expressed K12 and PAX6, but not K13 (Fig. 3f) or HOX genes 
(Extended Data Fig. 5b). P2 cells, on the other hand, expressed HOX 
genes (Extended Data Fig. 5b), which are not expressed in cells of the 
ocular surface!*!®, In the colonies of P2 cells, those expressing HOXB4 
and PAX6 existed exclusively of each other (Extended Data Fig. 5c). 
Immunostaining of P2 cells further demonstrated that they consisted 
of PAX6*/K13* conjunctival epithelial cells!” accompanied by a major- 
ity of PAX6/K13* non-ocular epithelial cells (Fig. 3e, f, Extended 
Data Fig. 5a). Long-term differentiation revealed that PAS-stained, 
MUCS5AC*, K7* conjunctival goblet-like cells appeared in presump- 
tive P2 cell regions (Extended Data Fig. 5d). Collectively, our data 
indicates that cells in zone 3 of the SEAM have characteristics of cor- 
neal, limbal, and conjunctival epithelial cells (Supplementary Table 1), 
raising the possibility that functional ocular surface epithelia could be 
developed for surgical use. 
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P1-P4. The data shown here are representative images of 23 independent 
cell-sorting experiments. d, Colony-forming assay for the sorted P1-P4 
cell fractions. Fixed colonies were stained with rhodamine (right upper 
panel, representative of seven independent cell-sorting experiments) and 
the colony-forming efficiency were calculated (graph: seven independent 
cell-sorting experiments). Error bars are s.d. Scale bar, 5mm. e, The 
ratios of corneal epithelial (PAX6*/K12* or PAX6*/K12!”), conjunctival 
epithelial (PAX6*/K13*/K127) and other epithelial cell types in the P2 
and P3 colonies from five independent cell sorting experiments. Error 
bars are s.d. f, Triple-colour immunostaining for PAX6 (green), K12 
(orange), K13(magenta), and p63 (green) expression in stratified P2 

and P3 cells and cultivated human corneal limbal epithelial cells (CECs) 
(representative of three independent experiments, respectively). Nuclei, 
blue. Scale bar, 50,1m. 


To investigate this prospect we generated ex vivo expanded sheets 
of corneal epithelium from FACS-sorted cells acquired from zone 3 
of the SEAM (Fig. 4a). The sheets expressed the corneal limbal stem- 
cell markers K15 and K19, along with the mucins MUC1, MUC4 
and MUC16, tight junction protein ZO-1, the differentiation marker 
CX43, K3 and K12, all of which are characteristic for the cornea 
(Fig. 4b, Extended Data Fig. 6a). The non-corneal keratins K10 and 
K18 were not expressed (Fig. 4b). Approximately 99% of the cells in 
the expanded sheets were stratified K14* epithelial cells, 95% were 
of corneal epithelial lineage (SSEA-4*), and 70% were differentiated 
corneal epithelial cells (K12*) (Fig. 4c). Cells typically had a smooth 
apical surface with microvilli-like structures (Extended Data Fig. 6b). 
Analyses based on significantly changed genes revealed that cells 
in the expanded corneal epithelial sheet differed from human oral 
mucosal keratinocytes, dermal fibroblasts and, indeed, human iPS 
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Figure 4 | Characterization and surgical use of human iPS 

cell-derived corneal epithelial cells (iCECs). a, Phase-contrast 
microscopy and haematoxylin and eosin (H&E) staining of the human 
iCECs on cell culture inserts (representative of four independent 
experiments, respectively). Scale bars, 100 1m (phase), 501m (H&E). 

b, Immunostaining for corneal epithelial functional proteins and 
epithelial markers (green) in the stratified SEAM-derived human iCEC 
sheets (representative of three independent experiments). Nuclei, red. 
Scale bar, 501m. ¢, Results of flow cytometric analyses for K14, K12 and 
SSEA-4 expression in the stratified human iCECs (representative of three 
independent experiments). d, Results of a principle component analysis 
based on the global gene expression (examined by microarrays) comparing 
human iPS cells (n = 3 technical replicates), ocular surface ectoderm 
(OSE; that is, human iPS cell-derived cells after six weeks of differentiation, 
n=3 technical replicates), human oral keratinocytes (OKs, n =3 technical 
replicates), human iCECs (n= 4 independent experiments), human 
dermal fibroblasts (DFs, n = 3 technical replicates) and human corneal 
epithelial cells from the limbus (CECs, n = 3 independent experiments). 

A total of 25,262 significantly changed genes (fold change >2.0, fasle 


cells, but were similar to cells harvested from the epithelium at the 
limbus of donated research human corneas (Fig. 4d, Extended Data 
Fig. 6c). Cells had the proliferative capability to be passaged 11 to 
16 times (average population doubling, 35.1; Fig. 4e, Extended Data 
Fig. 6d) without any obvious karyotype aberration at each passage 
(Extended Data Fig. 6e). Holoclone analysis indicated that 15.9% of 
the corneal epithelial colony was composed of holoclones (Fig. 4f). 
Together, the data show that corneal epithelial cells derived from 
our SEAM contain functional stem/progenitor cells equipped with 
high proliferative potential, which have the potential to reconstruct 
anterior eye epithelial tissue. Furthermore, all five different human 
iPS cell clones used were able to successfully induce corneal epithe- 
lial stem/progenitor cells, though each had a differential propensity 
for engendering corneal epithelial differentiation (Extended Data 
Fig. 7). 

To investigate the translational potential of this approach, 
ex vivo expanded corneal epithelial cell sheets were cultivated as 


Days after transplantation 


discovery rate <0.05) were analysed. Compared to human DFs, human 
iPS cells, human OKs or the OSE, the gene expression in human iCECs 
was most similar to that of human CECs. e, Proliferation profiles of the 
human iCECs during serial passages (n = 3 independent experiments, 
average population doubling, 35.1). f, Result of a holoclone analysis for 
human iCEC colonies. Representative images of holoclone-, meroclone- 
and paraclone-derived colonies and their frequencies are shown (n= 63 
single colonies from four independent cell-sorting experiments). Scale 
bar, 10mm. g, Barrier function assay using fluorescein staining for 
transplanted and sham-operated control corneas on days 0, 7 and 14 
post-surgery (left panels). A statistical analysis of the barrier function 
interpreted as the size of the fluorescein-negative area in the human 
iCEC-sheet-transplanted and control corneas (graph). *P < 0.05; n=7 
biological replicates, Steel’s test (Bonferroni corrected). Error bars are s.d. 
h, Immunostaining for human nuclei and corneal epithelial markers 
(green) in the SEAM-derived human iCEC-sheet-transplanted corneas on 
postoperative day 14 (n =6 animal transplantation experiments). Nuclei, 
red. Scale bar, 50 um. 


recoverable and translatable constructs, in which approximately 
1.0% of cells maintained a colony-forming capability (Extended Data 
Fig. 8a, b). When the sheets were transplanted onto the eyes of 
rabbits in an experimental model of corneal epithelial stem-cell 
deficiency (Extended Data Fig. 8c-j), they successfully recovered 
a healthy corneal barrier function (Fig. 4g, Extended Data Fig. 8k) 
and continued to express cornea-specific proteins (Fig. 4h). This 
advance—that is, the generation of functional SEAM-derived ocular 
surface tissue with stem/progenitor cells, which can surgically repair 
the front of the eye—is noteworthy because, although somatic stem 
cells have been used to recover the ocular surface, the long-term 
clinical results have not been overly encouraging'**°. In resolving 
key purification steps for corneal epithelial stem/progenitor cells 
by applying a combination of specific antibodies to our human iPS 
cell-derived ocular SEAM, we are now in the position to initiate first- 
in-human clinical trials of anterior eye transplantation to restore 
visual function. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Human iPS cell culture. The human iPS cell lines 201B7, 253G1, and 454E2 were 
obtained from the RIKEN Bio Resource Center (Tsukuba, Japan)”!””. The 1231A3 
and 1383D2 human iPS cells were provided by the Center for iPS Cell Research 
and Application, Kyoto University”. All cells were cultured in StemFit medium 
(Ajinomoto, Tokyo, Japan) on LN511E8-coated (0.5}1gcm 7) dishes?**4, LN511E8, 
produced using cGMP-banked CHO-S cells (Life Technologies, Carlsbad, CA), was 
obtained from Nippi (Tokyo, Japan). In part, LN511E8 was produced using human 
293-F cells as previously described”, The 201B7 and 454E2 human iPS cell lines 
were used in the in vitro experiments, while 201B7 and 1383D2 cells were used in 
the animal experiments; 253G1 and 1231A3 cells were used in the supplementary 
experiments, the results of which are reported in Extended Data Fig. 7. All of the 
experiments using recombinant DNA were approved by the Recombinant DNA 
Committees of Osaka University and were performed according to our institu- 
tional guidelines. 

Ocular cell differentiation from human iPS cells. The differentiation culture for 
human iPS cells was performed as indicated in Fig. 3a. First, human iPS cells were 
seeded on LN511E8-coated dishes at 350-700 cells cm~?, after which they were cul- 
tivated in StemFit medium for 8-12 days. The culture medium was then changed 
to DM (differentiation medium; GMEM (Life Technologies) supplemented with 
10% knockout serum replacement (KSR; Life Technologies), 1 mM sodium pyru- 
vate (Life Technologies), 0.1 mM non-essential amino acids (Life Technologies), 
2 mM t-glutamine (Life Technologies), 1% penicillin-streptomycin solution 
(Life Technologies) and 55\1M 2-mercaptoethanol (Life Technologies) or mon- 
othioglycerol (Wako, Osaka, Japan))’°. In some experiments, as indicated in the 
Results section, Noggin (R&D systems, Minneapolis, MN), LDN-193189 (Wako) 
or SB-431542 (Wako) were added for the first four days. BMP4 (R&D systems) 
was used in some early experiments at concentrations up to 0.125nM. This had no 
discernible effect on SEAM formation, however, so its use was discontinued. After 
four weeks of culture in DM, the medium was changed to corneal differentiation 
medium (CDM; DM and Cnt-20 or Cnt-PR (w/o; EGF and FGF2) (1:1, CELLnTEC 
Advanced Cell Systems, Bern, Switzerland) containing 5 ng ml—! FGE2 (Wako), 
20ng ml-! KGF (Wako) 104.M Y-27632 (Wako) and 1% penicillin-streptomycin 
solution). FGF2 in CDM was not essential for corneal epithelial induction. During 
CDM culture (around six to eight weeks of differentiation), non-epithelial cells 
were removed by manual pipetting under microscopy (Extended Data Fig. 2a, b). 
After pipetting, the medium was changed to fresh CDM. After four weeks of culture 
in CDM, the medium was changed to corneal epithelium maintenance medium 
(CEM; DMEM/F12 (2:1), Life Technologies) containing 2% B27 supplement (Life 
Technologies), 1% penicillin-streptomycin solution, 20ngml~! KGF and 10,.M 
Y-27632 for two to seven weeks. To achieve retinal differentiation (Fig. 2c) after 
four weeks of differentiation the medium was directly changed to CEM. Isolated 
RPE cell colonies were cultivated in CEM on separate dishes coated with LN511E8. 
Phase-contrast microscopic observations were performed with an Axio-observer. 
Z1, D1 (Carl Zeiss, Jena, Germany) and an EVOS FL Auto (Life Technologies). 
Flow cytometry and cell sorting. Differentiated human iPS cells in CEM were 
dissociated using Accutase (Life Technologies), and resuspended in ice-cold KCM 
medium (DMEM without glutamine and Nutrient Mixture F-12 Ham (3:1, Life 
Technologies) supplemented with 5% FBS (Japan Bio Serum, Hiroshima, Japan), 
0.4p1g ml! hydrocortisone succinate (Wako), 2nM 3,3’,5-Triiodo-L-thyronine 
sodium salt (MP biomedicals, Santa Ana, CA), 1 nM cholera toxin (List Biological 
Laboratory, Campbell, CA), 2.25 1g ml! bovine transferrin HOLO form (Life 
Technologies), 2mM t-glutamine, 0.5% insulin transferrin selenium solution 
(Life Technologies) and 1% penicillin-streptomycin solution). The harvested 
cells were filtered with a cell strainer (40|1m, BD Biosciences, San Diego, CA) and 
then stained with anti-SSEA-4 (MC813-70, Biolegend, San Diego, CA), TRA-1-60 
(TRA-1-60-R, Biolegend) and CD104 (ITGB4; 58XB4, Biolegend) antibodies for 
lh on ice. After being washed twice with PBS, stained cells underwent cell sorting 
with a FACSAria II instrument (BD Biosciences). For intracellular protein staining, 
a BD Cytofix/Cytoperm (BD Biosciences) kit was used. In all of the experiments, 
cells were stained with non-specific isotype IgG or IgM as controls (Biolegend). 
The data were analysed using the BD FACSDiva Software (BD Biosciences) and 
the FlowJo software program (TreeStar, San Carlos, CA). 

Fabrication and harvest of human iPS cell-derived corneal epithelial cell 
(human iCEC) Sheets. Sorted human iPS cell-derived epithelial cells obtained 
from zone 3 of the SEAM (human iCECs) were seeded on LN511E8 coated 
(0.5 gem”) cell culture inserts or temperature-responsive dishes (UpCell, 
CellSeed, Tokyo, Japan) without cell passaging, and were cultured in CEM until 
confluence”®. To promote maturation, the epithelial cells were cultivated in CMM 
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(corneal epithelium maturation medium; KCM medium containing 20 ng ml“! 
KGEF and 10\1M Y-27632) for an additional 3-14 days after CEM culture. The 
human iCECs cultivated on temperature-responsive dishes were released from 
their substrate by reducing the temperature to 20°C. 

Quantitative real-time reverse-transcriptase PCR (qRT-PCR). Total RNA was 
obtained from differentiated human iPS cells after specific culture periods, from 
human epidermal keratinocytes (EKs (foreskin), Life Technologies and TaKaRa 
Bio, Otsu, Japan), and from human corneal limbal epithelial cells (CECs) using the 
RNeasy total RNA kit or the QIAzol reagent (Qiagen, Valencia, CA). Reverse tran- 
scription was performed using the SuperScript III First-Strand Synthesis System 
for qRT-PCR (Life Technologies) according to the manufacturer’s protocol, and 
cDNA was used as a template for PCR. RT-PCR was performed using the ABI 
Prism 7500 Fast Sequence Detection System (Life Technologies) in accordance 
with the manufacturer's instructions. The TaqMan MGB used in the present study 
are shown in Supplementary Table 2. The thermocycling program was performed 
with an initial cycle at 95°C for 20s, followed by 45 cycles at 95°C for 3s and 60°C 
for 30s. 

Immunofluorescence staining. Research grade human skin tissue sections were 
obtained from US Biomax Inc. (MD, USA) and human oral mucosal tissue was 
obtained from Science Care (Phoenix, AZ). The cells were fixed in 4% paraform- 
aldehyde (PFA) or cold methanol, washed with Tris-buffered saline (TBS, TaKaRa 
Bio) three times for 10 min and incubated with TBS containing 5% donkey serum 
and 0.3% Triton X-100 for 1h to block non-specific reactions. They were then 
incubated with the antibodies shown in Supplementary Table 3 at 4°C overnight 
or at room temperature for 3h. The cells were again washed twice with TBS for 
10 min, and were incubated with a 1:200 dilution of Alexa Fluor 488-, 568-, 647- 
conjugated secondary antibodies (Life Technologies) for 1h at room temperature. 
Counterstaining was performed with Hoechst 33342 (Molecular Probes) before 
fluorescence microscopy (Axio Observer.D1, Carl Zeiss). 

Haematoxylin and eosin staining. Fabricated human iCEC sheets were fixed with 
10% formaldehyde neutral buffer solution (Nacalai Tesque, Kyoto, Japan). After 
washing with distilled water, the human iCEC sheets were embedded in paraffin 
from which 3-|1m-thick sections were cut. These were stained with haematoxylin 
and eosin following deparaffinization and hydration. The sections were observed 
with a NanoZoomer-XR C12000 (Hamamatsu Photonics, Hamamatsu, Japan), 
BZ-9000 (KEYENCE, Osaka, Japan) and an Axio Observer.D1. 

PAS staining. Differentiated human iPS cells (more than 12 weeks of differen- 
tiation) were fixed with 10% formaldehyde neutral buffer solution, after which 
PAS staining was performed with a PAS staining kit (MERCK KGaA, Darmstadt 
Germany) according to the manufacturer’s protocol. The sections were observed 
with an Axio Observer.D1. 

Colony-formation assay (CFA). Epithelial cells were seeded onto MMC-treated 
NIH-3T3 feeder layers at a density of 3,000-20,000 cells per well. These were cul- 
tivated in CMM for 7-14 days. The colonies were fixed with 10% formaldehyde 
neutral buffer solution and then stained with rhodamine B (Wako). Colony for- 
mation was then assessed using a dissecting microscope and the colony-forming 
efficiency (CFE) was calculated. For the holoclone analysis, a single human iCEC 
colony derived from the SEAM was cultivated on 3T3-J2 (provided by H. Green, 
Harvard Medical School, Boston, MA) in CMM for 7-11 days was picked up under 
a dissecting microscope and dissociated by TrypLE Select (Life Technologies). The 
dissociated human iCECs were again seeded on a MMC-treated 3T3-J2 feeder layer 
and cultivated in CMM for 10-13 days. The colonies were scored under a micro- 
scope and classified as holoclones, paraclones or meloclones based on previously 
reported methods”’. 

Microarray analysis. Human CECs were harvested from corneoscleral rims 
(Northwest Lions Eye Bank, Seattle, WA) as reported previously**. Human CECs 
and human oral keratinocytes (OKs; ScienCell, Carlsbad, CA) along with SEAM- 
derived human iCECs were cultivated on LN511E8 coated cell culture inserts in 
CEM until confluent. They were then cultivated in CMM. Human dermal fibro- 
blasts (DFs; ScienCell) were cultivated in DMEM/F12 (2:1) containing 10% FBS. 
Total RNA was obtained from human iPS cells, iCECs, CECs, OKs, DFs, and six- 
week differentiated iPS cells (that is, OSE) using the QIAzol reagent. A microarray 
analysis using Sure Print G3 human 8x60K slides (Agilent technologies, Palo Alto, 
CA) was performed at Takara Bio. The data were analysed using the GeneSpring 
GX software program (Agilent technologies). Microarray data used in this study 
are deposited in Gene Expression Omnibus under accession number GSE73971. 
Scanning electron microscopy (SEM). The cultivated epithelial cell sheets were 
fixed in 2.5% glutaraldehyde (Nacalai Tesque) at 4°C overnight. Subsequently, 
the sheets were washed in buffer, dehydrated with ethanol and tert-butyl alcohol 
(Wako), and critical point dried (JFD-320, JEOL, Tokyo, Japan). After sputter-coating 
with platinum in an auto fine coater (JFCL-1600, JEOL), the samples were 
observed by scanning electron microscopy (JSM-6510LA, JEOL) at 5kV. 
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Serial cell passaging and karyotype analysis. FACS-isolated human iCECs were 
cultivated on MMC-treated NIH-3T3 feeder layers in CMM up to 70-80% conflu- 
ence. The human iCECs were harvested using TrypLE Select following the removal 
of feeder cells by manual pipetting. The total cell numbers were counted, after 
which the cells were passaged at a 1:8 ratio onto newly prepared feeder layers. These 
were cultivated in CMM until sub-confluence was reached again. The G-band 
karyotype analysis for human iCECs was performed at Nihon Gene Research 
Laboratories (Sendai, Japan). 

Animal experiments. All animal experimentation was performed in accordance 
with the ARVO Statement for the Use of Animals in Ophthalmic and Vision 
Research, and was approved by the animal ethics committees of Osaka University. 
To examine embryonic mouse eyes, pregnant females (C57/BL6, E9.5-18.5) 
were acquired from SLC Japan (Shizuoka, Japan). For the transplantation experi- 
ments, Female New Zealand white rabbits (2.5-3.0kg (approximately 12-14 
weeks)) were obtained from Kitayama Labes (Nagano, Japan). Harvested human 
iCEC sheets were grafted onto rabbit corneas, in which a total epithelial limbal 
stem-cell deficiency had been created following a corneal and limbal lamellar 
keratectomy (Extended Data Fig. 8c-j). After surgery, 0.3% ofloxacin ointment 
(Santen Pharmaceutical, Osaka, Japan), 0.1% betamethasone phosphate eye drops 
(Shionogi Pharmaceutical, Osaka, Japan) and 0.1% sodium hyaluronate eye drops 
(Santen Pharmaceutical) were applied three to four times per day. Triamcinolone 
acetonide (8 mg; Bristol Myers Squibb, Tokyo, Japan) was also administered by sub- 
conjunctival injection. Tacrolimus (0.05 mg kg! per day, Astellas Pharma, Tokyo, 
Japan) and Mizoribine (4.0 mgkg ! per day Sawai Pharmaceutical, Osaka, Japan) 
were systemically administered using an osmotic pump (DURECT, Cupertino, 
CA). The corneal barrier function following surgery was assessed by 0.5% fluo- 
rescein eye drop instillation at day7 and day 14 after surgery and the fluorescein 
negative area was calculated using the AxioVision software program (Carl Zeiss). 
Throughout the healing period, the cornea was observed with a digital slit-lamp 
camera (SL-7F, TOPCON, Tokyo, Japan) and 3D OCT1000 MARK II (TOPCON) 
or CASIA SS-1000 (TOMEY, Nagoya, Japan) machines. If an infection was found 


or if unexpected weight loss occurred, animals were excluded from the analysis. 
The rabbits were euthanized by the intravenous administration of sodium pento- 
barbitone 14 days after transplantation, after which the eyes were immediately enu- 
cleated for the histological analyses. No blinding or randomization was conducted 
to allocate animals to each group. 
Statistical analyses. The data are expressed as means + standard deviation (s.d.). The 
statistical analyses were performed using the Mann-Whitney rank sum test or Steel's 
test. Bonferroni's correction was applied to the data in animal experiments. All of the 
statistical analyses were performed using the JMP software program (SAS institute 
Inc., Cary, NC). No statistical methods were used to predetermine sample size. 
Comprehensive technical details can be found in Protocols Exchange, http:// 
dx.doi.org/10.1038/protex.2016.009. 
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Extended Data Figure 1 | Differentiation of multiple ocular cells in the 
SEAM. a, A differentiated human iPS cell colony after 40 days of culture 
(macro photograph, representative of six independent experiments). Scale 
bar, 5 mm. b, Magnified views of each SEAM zone margin after six weeks 
in culture (phase-contrast and bright-field views, each representative of 
three independent experiments). Arrow heads indicate borders between 
each zone. Scale bars, 50j1m. c, Schematic for the development of the 
anterior eye. CE, corneal epithelium; CEnd, corneal endothelium; IS, iris 
stroma; NR, neuroretina; RPE, retinal pigment epithelium; LE, lens. 
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d, Immunostaining for p75 (green) and SOX10 (red) in SEAM zones 1 
and 2, two weeks (w) after the start of the differentiation culture 
(representative of three independent experiments). Asterisks indicate 
SOX10*/p75* neural crest cells. Nuclei, blue. Scale bar, 100m. 

e, Immunostaining of lens differentiation marker «-crystallin (green) and 
epithelial marker p63 (red) in zones 2 and 3 of the SEAM after six weeks 
of culture (representative of three independent experiments). Nuclei, blue. 
Scale bar, 100m. 
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Extended Data Figure 2 | Enrichment and differentiation of a SEAM- 
derived ocular surface epithelium. a, The enrichment of zone 3 ocular 
surface epithelium by manual pipetting. Cells in the SEAM before (upper 
panel) and after (lower panel) pipetting are shown (1 = 1). Scale bar, 200 1m. 
b, The expression of ocular-cell-related genes in removed and attached cells 
after pipetting (n= 1). c, The time course of PAX6, DN-p63, K14, K12 and 
LIN28A expression during human iPS cell differentiation culture (0, 2, 4, 6, 
8 and 12 weeks, each n=5 independent experiments). Error bars are s.d. 

d, Immunostaining for vimentin (VIM, green) and p63 (red) before and after 


L 


FACS purification conducted at 11 weeks of culture (representative of three 
independent experiments). VIM* stroma-like cells of zone 3 were removed 
by FACS. Nuclei, blue. Scale bar, 501m. e, Immunostaining for anterior 

eye development-related markers Pax6 (green), p63 (red), K14 (green) and 
K12 (red) during mouse eye development (E9.5-18.5, each representative 
of three experiments). Nuclei, blue. Asterisks indicate p63-expressing cells. 
PCE, presumptive corneal epithelium; OSEpi, ocular surface epithelium; 
CE, corneal epithelium; CS, corneal stroma; LV, lens vesicle; LE, lens; 

AC, anterior chamber; EL, eyelid; OV, optic vesicle. Scale bar, 50 |1m. 
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Extended Data Figure 3 | The effect of BMP/TGF@ inhibitors on the 
development of ocular surface epithelium in the SEAM. a, Microscopic 
observation of the SEAM pre-treated with Noggin, LDN-193189 (LDN) 
or SB-431542 (SB) for four days at the start of differentiation culture 
(control (—), Noggin and LDN data are representative of six independent 
experiments, while SB data are representative of four independent 
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experiments). Both BMP and TGF inhibitors resulted in the abolishment 
of zone 3. Scale bar, 200 um. b, The effects of BMP or TGF inhibitors 

on the expression of ocular ectoderm-related genes at five to six weeks of 
differentiation culture (*P < 0.05, control; n= 13: Noggin, LDN and SB; 


n=8 independent experiments, respectively, Mann-Whitney test). Error 
bars are s.d. 
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Extended Data Figure 4 | Isolation of corneal epithelial cells from 

the SEAM. a, Schematic for the induction of ocular surface epithelium 
from human iPS cells. The ocular surface ectoderm expressing PAX6 

and p63 in zone 3 of the SEAM further differentiated into functional 
ocular surface epithelial stem cells that expressed K14. Among the ocular 
surface epithelial lineage cells, corneal epithelial progenitor cells were 
isolated by FACS as SSEA-4*/ITGB4* cells. Conjunctival epithelial cells 
(PAX6*/K13+/K12~) were obtained as SSEA-4~ cells. NE, neuroectoderm; 
NR, neuroretina; NC, neural crest; OSE, ocular surface ectoderm; SE, 
surface ectoderm. b, Immunostaining for SSEA-4 (green) and K12 (red) 
in stratified epithelial tissues, including human ocular surface epithelium 
(cornea, limbus and conjunctiva), epidermis and oral mucosa. 
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A magnified view of the limbus (highlighted box) indicated that SSEA-4 
was expressed in all layers of the limbal epithelium, including the basal 
layer, which had no K12 (asterisks). No other stratified epithelium expressed 
SSEA-4 and K12. Nuclei, blue. Scale bars, 501m. (Data for corneal, 

limbal and conjunctival tissue; n = 3; epidermis and oral mucosa; n= 1). 
c, Immunostaining for PAX6 (green) in colonies derived from P3 and 

P2 cells (representative of five independent cell-sorting experiments). 
Nuclei, blue. Scale bar, 100 1m. d, The expression of corneal epithelial- 
specific genes and non-corneal epithelial genes in the sorted P3 cells (that 
is, human iPS cell-derived corneal epithelial cells, iCECs) and P2 cells. 
*P <0.05 (n=7 independent cell sorting experiments, Mann-Whitney 
test). Error bars are s.d. 
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Extended Data Figure 5 | Characterization of SEAM-derived ocular HOXB4 (red) expression in the colonies of P2 cells (n = 1). Nuclei, blue. 


surface epithelial cells. a, Triple colour immunostaining for PAX6 Scale bar, 100m. d, Goblet-cell-like differentiation in the SEAM-derived 
(green), K12 (orange) and K13 (magenta) in the epithelial colonies epithelium after long-term culture (more than 12 weeks of differentiation) 
from sorted P3 cells (that is, SEAM-derived human iCECs) and P2 cells without FACS in CEM (n= 1). Goblet-cell-like morphology was observed 
(representative of five independent cell-sorting experiments). Nuclei, blue. _in presumptive P2 cell regions (left panel). These cells were PAS-positive 


Scale bars, 100 1m. b, HOX gene expression in the sorted P3 cells, P2 cells, and expressed the goblet cell markers MUCS5AC (green) and K7 (red) in 
human epidermal keratinocytes (EKs) and human corneal limbal epithelial the superficial region and PAX6 in the basal region. Nuclei, blue. Scale 
cells (CECs). (P3 and P2 cells; 7 = 7: human EKs and human CECs; n=5 bars, 501m. 

independent experiments). Error bars are s.d. c, The PAX6 (green) and 
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Extended Data Figure 6 | Characterization of the SEAM-derived corneal 
epithelium. a, Immunostaining for K19 and CX43 (green) in the stratified 
SEAM-derived human iCECs (representative of n = 3 independent 
experiments). Magnified view of the dotted area is shown in the lower 
panel. Nuclei, red. Scale bars, 501m. b, Scanning electron microscopy of 
the apical surface of the stratified human iCECs (representative of two 
human iCEC sheets) and human CECs (n = 1). Scale bars, 101m (upper 
panels) and 1 jm (lower panels). c, Results of a hierarchical cluster analysis 
based on the global gene expression as examined by microarrays. Data are 
shown for human iPS cells (n =3 technical replicates), human iPS cell- 
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cells after six weeks of differentiation, n =3 technical replicates), human 
iCECs (n = 4 independent experiments), human oral keratinocytes (OKs, 
n= 3 technical replicates), human dermal fibroblasts (DFs, n = 3 technical 
replicates) and human corneal epithelial cells obtained from the limbus 
(CECs, n = 3 independent experiments). A total of 25,262 significantly 
changed genes (fold change >2.0, false discovery rate <0.05) were 
analysed. d, SEAM-derived human iCECs at passage (P) 1, 3, 9 and 15 
during serial passages (representative of three independent experiments). 
Scale bar, 200 1m. e, The G-band karyotype of human iCECs at P= 1, 6 
and 12 (n= 1, respectively). 


derived ocular surface ectoderm (OSE; that is, human iPS cell-derived 
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Extended Data Figure 7 | Induction of corneal epithelial cells from 
different human iPS cell clones. a, SEAM formation patterns of five 
different human iPS cell clones (201B7 and 1383D2; representative of 
three independent experiments: 253G1, 454E2 and 1231A3;n=1). 
PBMC, peripheral blood mononuclear cell; TE, transcription factor. Scale 
bar, 200 um. b, Stratified human iCECs from 454E2, 253G1, 1231A3 

and 1383D2 clones in vitro (454E2, 253G1 and 1231A3, n= 1; 1383D2, 
representative of three independent experiments). Phase-contrast 


H&E staining 


H&E staining 
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H&E staining 


Ratio of P. 


microscopy (upper panel), haematoxylin and eosin (H&E) staining 
(middle panel) and immunostaining for K12 (lower panel, green) are 
shown. Nuclei, red. Scale bars, 200 j1m (phase-contrast), 501m (H&E 
staining and immunostaining). c, Efficiency of corneal epithelial 
differentiation of the various human iPS cell clones (201B7, n= 23; 253G1, 
n=5;454E2, n=11; 1231A3, n=9 and 1383D2, n=6 independent 
experiments). Error bars are s.d. 
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Extended Data Figure 8 | Transplantation of the human iCEC sheet to 
repair the ocular surface. a, Phase-contrast microscopy of SEAM-derived 
human iCECs cultivated on a temperature-responsive dish at 1, 5 and 

14 days (representative of three independent experiments). A harvested 
human iCEC sheet is shown in the lower right panel. Scale bar, 200 pm 
(phase-contrast), 5mm (macro photo). b, Immunostaining for PAX6 
(green), K12 (orange) and K14 (magenta) in the human iCEC sheets after 
21 days in culture (left panels, representative of three human iCEC sheets). 
The right panel shows an image used for a colony-forming assay (CFA) 
for the human iCEC sheets (15,000 cells per well, representative of eight 
independent experiments). The colony-forming efficiency (CFE) was 

1.04 + 0.43% (s.d., n = 8 independent experiments). Nuclei, blue. Scale 
bars, 100 1m (immunostaining) and 5mm (CFA). c, Treatment of rabbit 


CFE: 1.04+0.43% 


ocular surface with a surgical swab soaked in 99% ethanol for 30-60 s. 
d, e, Elimination of corneal epithelial stem cells by the surgical removal 

of corneal and limbal epithelial tissue (that is, lamellar keratectomy). 

f, The ocular surface was invaded by conjunctival tissue and vessels at 
postoperative day 28. g, Removal of the conjunctival tissue that covered 
the ocular surface. h, Widespread fluorescein staining of the ocular surface 
after surgical removal of epithelial tissue. i, The harvest of a human iCEC 
sheet from a temperature-responsive dish after lowering the temperature. 
j, Transplantation of the human iCEC sheet onto the rabbit cornea with 

a surgically induced corneal epithelial stem-cell deficiency treated 

as described in c-h. k, H&E staining for human iCEC-sheet-transplanted 
and control corneas on postoperative day 14 (n =6 animal transplantation 
experiments). Scale bar, 100,1m. 
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Therapeutic efficacy of the small molecule GS-5734 
against Ebola virus in rhesus monkeys 


Travis K. Warren!*, Robert Jordan*, Michael K. Lo*, Adrian S. Ray*, Richard L. Mackman’, Veronica Soloveva!, Dustin Siegel’, 
Michel Perron®, Roy Bannister?, Hon C. Hui’, Nate Larson’, Robert Strickley*, Jay Wells!, Kelly S. Stuthman!, 

Sean A. Van Tongeren!, Nicole L. Garza', Ginger Donnelly!, Amy C. Shurtleff!, Cary J. Retterer!, Dima Gharaibeh!, 

Rouzbeh Zamani!, Tara Kenny!, Brett P. Eaton!, Elizabeth Grimes!, Lisa S. Welch!+, Laura Gomba!”, Catherine L. Wilhelmsen!, 
Donald K. Nichols!, Jonathan E. Nuss!?, Elyse R. Nagle!, Jeffrey R. Kugelman!, Gustavo Palacios!, Edward Doerffler’, 
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Pamela Wong®, Molly R. Braun®, Mike Flint*, Laura K. McMullan’, Shan-Shan Chen’, Rachel Fearns®, Swami Swaminathan’, 


Douglas L. Mayers't, Christina F. Spiropoulou’, William A. Lee®, Stuart T. Nichol*, Tomas Cihlar? & Sina Bavari!> 


The most recent Ebola virus outbreak in West Africa, which was 
unprecedented in the number of cases and fatalities, geographic 
distribution, and number of nations affected, highlights the 
need for safe, effective, and readily available antiviral agents for 
treatment and prevention of acute Ebola virus (EBOV) disease 
(EVD) or sequelae’. No antiviral therapeutics have yet received 
regulatory approval or demonstrated clinical efficacy. Here 
we report the discovery of a novel small molecule GS-5734, a 
monophosphoramidate prodrug of an adenosine analogue, with 
antiviral activity against EBOV. GS-5734 exhibits antiviral activity 
against multiple variants of EBOV and other filoviruses in cell- 
based assays. The pharmacologically active nucleoside triphosphate 
(NTP) is efficiently formed in multiple human cell types incubated 
with GS-5734 in vitro, and the NTP acts as an alternative substrate 
and RNA-chain terminator in primer-extension assays using a 
surrogate respiratory syncytial virus RNA polymerase. Intravenous 
administration of GS-5734 to nonhuman primates resulted in 
persistent NTP levels in peripheral blood mononuclear cells (half- 
life, 14h) and distribution to sanctuary sites for viral replication 
including testes, eyes, and brain. In a rhesus monkey model of EVD, 
once-daily intravenous administration of 10 mgkg~! GS-5734 for 
12 days resulted in profound suppression of EBOV replication and 
protected 100% of EBOV-infected animals against lethal disease, 
ameliorating clinical disease signs and pathophysiological markers, 
even when treatments were initiated three days after virus exposure 
when systemic viral RNA was detected in two out of six treated 
animals. These results show the first substantive post-exposure 
protection by a small-molecule antiviral compound against 
EBOV in nonhuman primates. The broad-spectrum antiviral 
activity of GS-5734 in vitro against other pathogenic RNA viruses, 
including filoviruses, arenaviruses, and coronaviruses, suggests the 
potential for wider medical use. GS-5734 is amenable to large-scale 
manufacturing, and clinical studies investigating the drug safety 
and pharmacokinetics are ongoing. 

The 2013-2016 outbreak of EVD in West Africa was the largest 
and most complex EBOV outbreak in the recorded history of the dis- 
ease, with >28,000 EVD cases and >11,000 reported deaths!. Medical 
infrastructures in Guinea, Sierra Leone, and Liberia were seriously 
impacted by a loss of >500 healthcare workers’. Additionally, EVD- 
related sequelae (joint and muscle pain, as well as neurological, oph- 
thalmic, and other symptoms) together with viral persistence and 


+12 


recrudescence in individuals who survived the acute disease have 
been documented”>. 

EBOV is a single-stranded negative-sense non-segmented RNA 
virus from the Filoviridae family. In addition to EBOV, other related 
viruses, namely Marburg, Sudan, and Bundibugyo viruses, have caused 
outbreaks with high fatality rates®. Although the efficacy of various 
experimental small molecules and biologics have been assessed in EVD 
animal models and in multiple clinical trials during the West African 
outbreak’~!8, there are no therapeutics for which clinical efficacy and 
safety have been established for treatment of acute EVD or its seque- 
lae. The availability of broadly effective antiviral(s) with a favourable 
benefit/risk profile would address a serious unmet medical need for the 
treatment of EBOV infection. 

A 1/-cyano-substituted adenine C-nucleoside ribose analogue (Nuc) 
exhibits antiviral activity against a number of RNA viruses'®. The 
mechanism of action of Nuc requires intracellular anabolism to the 
active triphosphate metabolite (NTP), which is expected to interfere 
with the activity of viral RNA-dependent RNA-polymerases (RdRp). 
Structurally, the 1’-cyano group provides potency and selectivity 
towards viral RNA polymerases, but because of slow first phosphoryl- 
ation kinetics, modification of parent nucleosides with monophosphate 
promoieties has the potential to greatly enhance intracellular NTP 
concentrations”’. GS-5734, the single Sp isomer of the 2-ethylbutyl 
L-alaninate phosphoramidate prodrug (Supplementary Information), 
effectively bypasses the rate-limiting first phosphorylation step of the 
Nuc (Fig. 1a). In human monocyte-derived macrophages, incubation 
with GS-5734 caused rapid loading of cells with high levels of NTP 
that persist with a half-life (t)/2) of 24h following removal of GS-5734 
(Extended Data Fig. 1a), resulting in up to 30-fold higher levels com- 
pared to incubation with Nuc (Fig. 1b). In cell-based assays, GS-5734 
is active against a broad range of filoviruses including Marburg virus 
and several variants of EBOV (Fig. 1c). GS-5734 inhibits EBOV 
replication in multiple relevant human cell types including primary 
macrophages and human endothelial cells with half-maximum effective 
concentration (ECs) values of 0.06 to 0.14|1.M (Table 1). As expected, 
the parent Nuc was less active, with ECs values of 0.77 to >20 1M. 
Treatment with GS-5734 of liver Huh-7 cells infected with the EBOV 
Makona variant, isolated during the West African outbreak, resulted 
in profound dose-dependent reductions in viral RNA production 
and infectious virus yield (Extended Data Fig. 2). GS-5734 and Nuc 
inhibited replication of other human RNA viral pathogens including 
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Figure 1 | Metabolism and mechanism of antiviral activity of GS-5734. 
a, Chemical structures of GS-5734 and metabolic conversion to NTP. 

b, NTP formation in human monocyte-derived macrophages following 
72-h incubation with 1 1M GS-5734 (black) or Nuc (red); mean +s.d., 
from 3 donors. c, Antiviral activity of GS-5734 in HeLa cells against EBOV 
Makona (black symbols), EBOV Kikwit (open symbols), Marburg (red), 
Bundibugyo (orange), Sudan (blue) viruses; mean + s.d. from triplicates. 


respiratory syncytial virus (RSV), Junin virus, Lassa fever virus, and 
Middle East respiratory syndrome virus, but was inactive against alpha- 
viruses or retroviruses (Table 1). Previous studies have reported activity 
of Nuc against flaviviruses, parainfluenza virus type 3, and severe acute 
respiratory syndrome associated coronavirus, but little or no activity 
against West Nile, influenza A, or Coxsackie A viruses!?”!, The antiviral 
activity of GS-5734 was selective, as demonstrated by low cytotoxicity in a 
wide range of human primary cells and cell lines (Extended Data Table 1). 
Isolation and expression of EBOV RdRp has been elusive, but 
computational analysis of the catalytic palm subdomain demonstrated 
high sequence and structure homology with RSV RdRp” (Fig. 1d, 
Extended Data Fig. 3). Consistent with the proposed mechanism of 
action, NTP inhibited RSV RdRp-catalysed RNA synthesis (Fig. le) by 
incorporating into the nascent viral RNA transcript and causing its pre- 
mature termination (Fig. 1f). In contrast, NTP did not inhibit human 
RNA polymerases (Fig. le). These data suggest that GS-5734 selectively 
inhibits EBOV replication by targeting its RdRp and inhibiting viral 
RNA synthesis following efficient intracellular conversion to NTP. 
Rodent models were not suitable for GS-5734 in vivo efficacy eval- 
uations because high serum esterase activity, present in many rodent 
species, degrades the GS-5734 promoiety and adversely impacts its 
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d, Amino acid sequence homology of EBOV and RSV strain A2 RdRp 
active site. Asterisks indicate residues predicted to contact NTP. 

e, Inhibition of RSV RdRp (blue), but not human RNA Pol II (black) 

or mitochondrial RNA (red) polymerases by NTP; mean +s.d., n =3 
biological replicates. f, NTP-induced RNA chain termination in RSV RdRp 
primer-extension assay. For gel source data, see Supplementary Fig. 1. 


pharmacokinetic profile’®. Like humans, rhesus monkeys do not 
express high levels of serum esterase; rhesus lymphoid cells efficiently 
activated GS-5734 in vitro, although NTP levels were reduced relative 
to human cells (Extended Data Fig. 1b). In rhesus monkeys, intramus- 
cular inoculation with clinically derived wild-type EBOV produces a 
fulminant lethal disease with pathophysiological responses that closely 
resemble human EVD cases”*”°, and nonhuman primates (NHP) are 
considered the most relevant EVD models well-suited for evaluating 
the efficacy of antiviral interventions when trials in infected humans 
are not feasible. 

GS-5734 pharmacokinetics, metabolism, and distribution were 
examined in NHPs. Upon intravenous administration ofa 10 mgkg™! 
dose in rhesus monkeys, GS-5734 exhibited a short plasma half-life 
(t/2=0.39h) with fast systemic elimination followed by the sequential 
appearance of transient systemic levels of the key intracellular inter- 
mediate alanine metabolite and more persistent levels of Nuc (Fig. 2a). 
GS-5734 rapidly distributed into peripheral blood mononuclear cells 
(PBMCs), and efficient conversion to NTP was apparent within 2h 
of dose administration. In PBMCs, NTP represents the predominant 
metabolite and was persistent with a f,/2 of 14h and levels required 
for >50% virus inhibition for 24h (Fig. 2a, Extended Data Fig. 1c). 
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Table 1 | Antiviral activity of GS-5734 and Nuc 


Antiviral activity; ECs0/ECao (uM) 


GS-5734 Nuc 
EBOV 
Primary macrophages* 0.086/0.18 >20/>20 
HeLa cellst 0.14/0.41 >20/>20 
HFF-1* 0.13/0.26 >20/>20 
HMVEC-TERT cellst 0.06/0.22 0.77/3.12 
Huh-7 cellst 0.07/0.22 1.49/6.04 
Other human RNA viruses 
RSV¢ 0.019/0.051 0.75/2.84 
JUNV§ 0.47/1.33 
LASV§ 1.48/2.80 
MERS§ 0.34/4.24 
CHIV§ >20/>20 
VEEV§ >20/>20 
HIV-1§ >20/>20 


Half-maximum cytotoxic concentration (CCso) values for all compounds in primary human cells 
and human cell lines were greater than the highest concentration tested (>20,1M). 

*Mean values from duplicated titrations conducted in differentiated macrophages or HFF-1 
cells in a single experiment (n= 1). Cells were infected with EBOV (Makona) for antiviral activity 
determination. 
Mean values from quadruplicate (HMVEC-TERT) or duplicate (Huh-7) titrations generated from 
two experiments (n= 2) or from multiple experiments (n = 6) for HeLa cells. Cells were infected 
with a replication-competent reporter virus (EBOV-GFP) or wild-type EBOV strain Zaire (HeLa) for 
antiviral activity determination. 
Mean values from two (GS-5734) or four (Nuc) independent experiments with each drug dilution 
ested in triplicate against the respiratory syncytial virus (RSV). 
§Mean values from duplicate titrations with each drug concentration tested in quadruplicate from 
a single experiment (n= 1). 

JUNY, Junin virus; LASV, Lassa fever virus; MERS, Middle East respiratory syndrome coronavirus; 
CHIV, Chikungunya virus, VEEV, Venezuelan equine encephalitis virus; HIV-1, human immunode- 
ficiency virus type 1. 


In cynomolgus monkeys, intravenous administration of a 10 mgkg~! 
dose of ['4C]GS-5734 demonstrated that the drug-derived material 
distributed to testes, epididymis, eyes, and brain within 4h of admin- 
istration (Fig. 2b). Levels in the brain at 4h were low relative to other 
tissues, but remained detectable above the drug plasma levels 168h 
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Figure 2 | GS-5734 pharmacokinetics and post-exposure protection against 
EBOV in rhesus monkeys. a, Pharmacokinetics following intravenous 
administration of 10mgkg~' GS-5734 dose in healthy rhesus macaques 
(mean +s.d., n= 3). Plasma GS-5734 (black), alanine metabolite (red), and 
Nuc (blue); NTP in PBMCs (green). b, Tissue distribution of ['4C]GS-5734 
and metabolites at 4h (blue) and 168h (red) following intravenous 10 mgkg~’ 
GS-5734 dose in healthy cynomolgus macaques (mean +s.d., n= 3). ¢, 
Experimental design for GS-5734 efficacy evaluations in rhesus monkeys. No 
T,, no treatment. d, Kaplan-Meier survival curves. *P < 0.05 for treatment 
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after dose administration. Taken together, the pharmacokinetic anal- 
ysis indicates that once-daily dosing of GS-5734 provides sustained 
intracellular NTP levels and efficiently delivers drug metabolites to 
sanctuary sites where virus may persist. 

To evaluate the in vivo efficacy of GS-5734, we conducted a sequen- 
tial two-part adaptive design study in EBOV-infected rhesus monkeys 
(Fig. 2c). In part 1, animals intramuscularly inoculated with EBOV 
were administered a 12-day treatment of vehicle (n= 3) or 3mgkg™! 
GS-5734 beginning on day 0 (d0; 30-90 min following virus challenge) 
or day 2 (d2) (n=6 per treatment group). Regardless of the time of 
initiation, GS-5734 treatment conferred an antiviral effect by reducing 
systemic viraemia relative to vehicle and an improved survival of 33% (2 
out of 6) in the 3mgkg~’ dO group and 66% (4 out of 6) in the 3mgkg~ 
d2 group (Fig. 2d, e, Extended Data Fig. 4, Extended Data Tables 2, 3); 
however, mortalities observed in both treatment groups suggested that 
drug exposure at 3mgkg~' was suboptimal. In part 2 of the efficacy 
study, GS-5734 was administered once at a loading dose of 10 mgkg™! 
followed by once-daily 3 mg kg~! doses beginning either 2 days 
(10/3 mg kg™! d2) or 3 days (10/3 mgkg~! d3) after virus exposure, 
or 10mgkg™! doses were administered beginning 3 days after virus 
exposure (10mgkg~‘ d3; n=6 per group). All 12 animals in which 
GS-5734 treatments were initiated 3 days after virus exposure survived 
to the end of the in-life phase (Fig. 2d). However, the antiviral effects 
were consistently greater in animals administered repeated 10mg kg™! 
GS-5734 doses (Fig. 2e, f, Extended Data Fig. 4, Extended Data Tables 2, 3). 
On day 4, plasma viral RNA was significantly decreased (P< 0.05), with 
geometric means reduced by >1.7 logy in all GS-5734-treated groups 
compared with combined vehicle-treated groups (Fig. 2e, f, Extended 
Data Table 3), and on days 5 and 7, when the geometric mean viral 
RNA concentration of the vehicle group exceeded 10° copies ml“, 
viral RNA was detected at levels less than the lower limit of quantita- 
tion (8 x 10* RNA copies ml~') in 4 of 6 animals in the 10 mgkg~! d3 
group. Deep sequencing analysis of the EBOV RdRp (L) gene from all 
plasma samples positive for viral RNA showed no evidence of genotypic 
changes potentially associated with the emergence of GS-5734-resistant 
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versus vehicle groups assessed by log-rank analysis using Dunnett-Hsu 
procedure to adjust for multiple comparisons. e, Group geometric mean 

of plasma viral RNA concentrations; LLOQ, lower limit of quantitation; 
LOD, limit of detection. f, Individual plasma viral RNA in vehicle (blue) or 
10mgkg_! GS-5734 (black) groups. g, Group average clinical disease score. 
d, e, g, Black (open symbols), vehicle; red, 3mgkg~! dO; green, 3mgkg™! 

d2; blue, 10/3 mgkg™! d2; orange, 10/3 mgkg! d3; black (closed symbols), 
10mgkg' d3; n=6 animals per group. Error bars omitted for clarity (e, g); x 
axes truncated to emphasize acute disease phase (f, g). 
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Figure 3 | Amelioration of EVD clinical pathology by GS-5734 in 
rhesus monkeys. a—f, Group mean (n =6 per group) values of platelets 
(a), thrombin time (b), aspartate aminotransferase (AST, c), alanine 
aminotransferase (ALT, d), blood urea nitrogen (BUN, e), and creatinine 
(f). Black (open symbols), vehicle; red, 3 mgkg~! dO; green, 3mgkg~! d2; 
blue, 10/3 mgkg~! d2; orange, 10/3 mgkg ! d3; black (closed symbols), 
10mgkg' d3. Error bars omitted for clarity; x axes truncated at day 15. 
*P <0.05 for comparison of mean change from day 0 of vehicle and 
10mgkg~' d3 groups at day 7 using Wilcoxon rank-sum test without 
adjustment for multiple comparisons. 


EBOV variants (Extended Data Table 5). The 10mgkg~’ d3 GS-5734 
regimen was associated with amelioration of EVD-related clinical 
disease signs (Fig. 2g, Extended Data Fig. 4) and markers of coagulopa- 
thy and end-organ pathophysiology (Fig. 3a-f, Extended Data Table 4, 
Extended Data Fig. 5). Although greater survival was observed in the 
10/3 mgkg™! d3 group than the 10/3 mgkg~! d2 group, survival and 
viral RNA load were not statistically distinguishable (Fig. 2d, e) and 
probably represent natural endpoint variation associated with subop- 
timal therapeutic effect. 

In summary, GS-5734 is a potent and selective inhibitor of EBOV 
in multiple relevant permissive cell types. In healthy NHPs, intrave- 
nous administration of GS-5734 resulted in rapid accumulation and 
persistence of intracellular NTP. In an NHP model of fatal EVD, pro- 
nounced antiviral effects, amelioration of EVD signs, and significant 
survival benefit was achieved despite treatment initiation on day 3, 
a time when systemic viral RNA was detectable. These results represent 
the first case of substantive post-exposure protection against EVD bya 
small-molecule antiviral compound in NHPs. Intravenous GS-5734 is 
currently being evaluated in multiple-dose studies in healthy human 
volunteers to assess clinical safety and pharmacokinetics, and help 
determine whether GS-5734 may provide therapeutic benefit in acute 
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or recrudescent cases of EVD, or in survivors with prolonged virus 
shedding and/or chronic clinical sequelae. The broad-spectrum anti- 
viral activity of GS-5734 and its amenability to large-scale production 
warrants further assessment of its therapeutic potential against other 
human viral pathogens for which no treatment is available. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size 
for biochemical or cell-based assays, or for pharmacokinetic studies. Investigators 
were not blinded to outcome assessment during these investigations. For GS-5734 
efficacy assessments in nonhuman primates, statistical power analysis was used to 
predetermine sample size, and subjects were randomly assigned to experimental 
group, stratified by sex and balanced by body weight. Study personnel responsible 
for assessing animal health (including euthanasia assessment) and administer- 
ing treatments were experimentally blinded to group assignment of animals and 
outcome. 
Small molecules. GS-5734, Nuc, and NTP were synthesized at Gilead Sciences, 
Inc., and chemical identity and sample purity were established using NMR, HRMS, 
and HPLC analysis (Supplementary Information). The radiolabelled analogue 
(!4C]GS-5734 (specific activity, 58.0 mCi mmol™!) was obtained from Moravek 
Biochemicals (Brea, California) and was prepared in a similar manner described 
for GS-5734 using ['4C]trimethylsilylcyanide (Supplementary Information). Small 
molecule X-ray crystallographic coordinates and structure factor files have been 
deposited in the Cambridge Structural Database (http://www.ccdc.cam.ac.uk/) and 
accession numbers are supplied in the Supplementary Information. 
Viruses. RSV A2 was purchased from Advanced Biotechnologies, Inc. EROV 
(Kikwit and Makona variants), Sudan virus (SUDV, Gulu), Marburg virus 
(MARV, Ci67), Junin virus (JUNV, Romero), Lassa virus (LASV, Josiah), Middle 
East respiratory syndrome virus (MERS, Jordan N3), Chikungunya virus (CHIV, 
AF 15561), and Venezuelan equine encephalitis virus (VEEV, SH3) were all prepared 
and characterized at the United States Army Medical Research Institute for infec- 
tious diseases (USAMRIID). EBOV containing a GFP reporter gene (EBOV-GFP), 
EBOV Makona (Liberia, 2014), and MARV containing a GFP reporter gene 
(MARV-GEFP) were prepared and characterized at the Centers for Disease Control 
and Prevention?” 
Cells. HEp-2 (CCL-23), PC-3 (CCL-1435), HeLa (CCL-2), U2OS (HTB-96), Vero 
(CCL-81), HFF-1 (SCRC-1041), and HepG2 (HB-8065) cell lines were purchased 
from the American Type Culture Collection. Cell lines were not authenticated 
and were not tested for mycoplasma as part of routine use in assays. HEp-2 cells 
were cultured in Eagle’s Minimum Essential Media (MEM) with GlutaMAX 
supplemented with 10% fetal bovine serum (FBS) and 100 U ml"! penicillin and 
streptomycin. PC-3 cells were cultured in Kaighn’s F12 media supplemented with 
10% FBS and 100U ml"! penicillin and streptomycin. HeLa, U2OS, and Vero 
cells were cultured in MEM supplemented with 10% FBS, 1% L-glutamine, 10 mM 
HEPES, 1% non-essential amino acids, and 1% penicillin/streptomycin. HFF-1 
cells were cultured in MEM supplemented with 10% FBS and 0.5mM sodium 
pyruvate. HepG2 cells were cultured in Dulbecco’s Modified Eagle Medium 
(DMEM) with GlutaMAX supplemented with 10% FBS, 100U ml"! penicillin 
and streptomycin, and 0.1 mM non-essential amino acids. The MT-4 cell line was 
obtained from the NIH AIDS Research and Reference Reagent Program and cul- 
tured in RPMI-1640 medium supplemented with 10% FBS, 100 Uml! penicillin 
and streptomycin, and 2mM t-glutamine. The Huh-7 cell line was obtained from 
C. M. Rice (Rockefeller University) and cultured in DMEM supplemented with 
10% FBS, 100 Ul" penicillin and streptomycin, and non-essential amino acids. 
Primary human hepatocytes were purchased from Invitrogen and cultured 
in William’s Medium E medium containing cell maintenance supplement. 
Donor profiles were limited to 18- to 65-year-old nonsmokers with limited 
alcohol consumption. Upon delivery, the cells were allowed to recover for 24h 
in complete medium with supplement provided by the vendor at 37°C. Human 
PBMCs were isolated from human buffy coats obtained from healthy volunteers 
(Stanford Medical School Blood Center, Palo Alto, California) and maintained 
in RPMI-1640 with GlutaMAX supplemented with 10% FBS, 100 Uml! pen- 
icillin and streptomycin. Rhesus fresh whole blood was obtained from Valley 
Biosystems. PBMCs were isolated from whole blood by Ficoll-Hypaque den- 
sity gradient centrifugation. Briefly, blood was overlaid on 15 ml Ficoll-Paque 
(GE Healthcare Bio-Sciences AB), and centrifuged at 500g for 20 min. The top 
layer containing platelets and plasma was removed, and the middle layer con- 
taining PBMCs was transferred to a fresh tube, diluted with Tris buffered saline 
up to 50 ml, and centrifuged at 500g for 5 min. The supernatant was removed 
and the cell pellet was resuspended in 5 ml red blood cell lysis buffer (155 mM 
ammonium chloride, 10 mM potassium bicarbonate, 0.1 mM EDTA, pH 7.5). To 
generate stimulated PBMCs, freshly isolated quiescent PBMCs were seeded into a 
T-150 (150 cm?) tissue culture flask containing fresh medium supplemented with 
10 U ml ! of recombinant human interleukin-2 (IL-2) and 11g ml“! phytohae- 
magglutinin-P at a density of 2 x 10° cells ml“! and incubated for 72h at 37°C. 
Human macrophage cultures were isolated from PBMCs that were purified by 
Ficoll gradient centrifugation from 50 ml of blood from healthy human volun- 
teers. PBMCs were cultured for 7 to 8 days in in RPMI cell culture media supple- 
mented with 10% FBS, 5 to 50ng ml! granulocyte-macrophage colony-stimulating 


factor and 50,1M 3-mercaptoethanol to induce macrophage differentiation. 
The cryopreserved human primary renal proximal tubule epithelial cells were 
obtained from LifeLine Cell Technology and isolated from the tissue of human 
kidney. The cells were cultured at 90% confluency with RenaLife complete medium 
ina T-75 flask for 3 to 4 days before seeding into 96-well assay plates. Immortalized 
human microvascular endothelial cells (HMVEC-TERT) were obtained from R. 
Shao at the Pioneer Valley Life Sciences Institute**, HMVEC-TERT cells were 
cultured in endothelial basal media supplemented with 10% FBS, 51g of epithelial 
growth factor, 0.5 mg hydrocortisone, and gentamycin/amphotericin-B. 
Enzymes. RNA POLII was purchased as part of the HeLaScribe Nuclear Extract 
in vitro Transcription System kit from Promega. The recombinant human POLRMT 
and transcription factors mitochondrial transcription factors A (mtTFA or TFAM) 
and B2 (mtTFB2 or TFB2M) were purchased from Enzymax. RSV ribonucleopro- 
tein (RNP) complexes were prepared according to a method modified from ref. 29. 
Intracellular metabolism studies. The intracellular metabolism of GS-5734 was 
assessed in different cell types (HMVEC and HeLa cell lines, and primary human 
and rhesus PBMCs, monocytes and monocyte-derived macrophages) following 2-h 
pulse or 72-h continuous incubations with 10|1M GS-5734. For comparison, intra- 
cellular metabolism during a 72-h incubation with 10|1M of Nuc was completed in 
human monocyte-derived macrophages. For pulse incubations, monocyte-derived 
macrophages isolated from rhesus monkeys or humans were incubated for 2h 
in compound-containing media followed by removal, washing with 37 °C drug- 
free media, and incubated for an additional 22 h in media which did not contain 
GS-5734. Human monocyte-derived macrophages, HeLa and HMVEC were grown 
to confluence (approximately 0.5, 0.2, and 1.2 x 10° cells per well, respectively) 
in 50011 of media in 12-well tissue culture plates. Monocyte and PBMCs were 
incubated in suspension (approximately 1 x 10° cells ml~!) in 1 ml of media in 
micro centrifuge tubes. 

For adherent cells (HMVEC, HeLa, and monocyte-derived macrophages), 
media was removed at select time points from duplicate wells, cells washed twice 
with 2 ml of ice-cold 0.9% normal saline. For non-adherent cells (monocytes 
and PBMCs), duplicate incubations were centrifuged at 2,500g for 30s to remove 
media. The cell pellets were re-suspended with 500 1 cell culture media (RPMI 
with 10% FBS) and layered on top of a 500 il oil layer (Nyosil M25; Nye Lubricants) 
in a microcentrifuge tube. Samples were then centrifuged at room temperature at 
13,000 r.p.m. for 45s. The media layer was removed and the oil layer was washed 
twice with 50011 water. The oil layer was then carefully removed using a Pasteur 
pipet attached to vacuum. A volume of 0.5 ml of 70% methanol containing 100nM 
of the analytical internal standard 2-chloro-adenosine-5’-triphosphate (Sigma- 
Aldrich) was added to isolated cells. Samples were stored overnight at —20°C to 
facilitate extraction, centrifuged at 15,000g for 15 min and then supernatant was 
transferred to clean tubes for drying in a MiVac Duo concentrator (Genevac). 
Dried samples were then reconstituted in mobile phase A containing 3 mM ammo- 
nium formate (pH 5.0) with 10mM dimethylhexylamine (DMH) in water for anal- 
ysis by liquid chromatography coupled to triple quadrupole mass spectrometry 
(LC-MS/MS). 

LC-MS/MS was performed using low-flow ion-pairing chromatography, sim- 
ilar to methods described previously*”. Briefly, analytes were separated using a 
50 x 2mm x 2.541m Luna C18(2) HST column (Phenomenex) connected to 
a LC-20ADXR (Shimadzu) ternary pump system and HTS PAL autosampler 
(LEAP Technologies). A multi-stage linear gradient from 10% to 50% acetonitrile 
in a mobile phase containing 3 mM ammonium formate (pH 5.0) with 10 mM 
dimethylhexylamine over 8 min at a flow rate of 15011 min”! was used to separate 
analytes. Detection was performed on an API 4000 (Applied Biosystems) MS/MS 
operating in positive ion and multiple reaction monitoring modes. Intracellular 
metabolites alanine metabolite, Nuc, nucleoside monophosphate, nucleoside 
diphosphate, and nucleoside triphosphate were quantified using 7-point standard 
curves ranging from 0.274 to 200 pmol (approximately 0.5 to 400|1M) prepared in 
cell extract from untreated cells. Levels of adenosine nucleotides were also quan- 
tified to assure dephosphorylation had not taken place during sample collection 
and preparation. In order to calculate intracellular concentration of metabolites, 
the total number of cells per sample were counted using a Countess automated 
cell counter (Invitrogen). 

EBOV Huh-7 and HMVEC antiviral assay. Antiviral assays were conducted 
in biosafety level 4 containment (BSL-4) at the Centers for Disease Control and 
Prevention. EBOV antiviral assays were conducted in primary HMVEC-TERT 
and in Huh-7 cells. Huh-7 cells were not authenticated and were not tested for 
mycoplasma. Ten concentrations of compound were diluted in fourfold serial 
dilution increments in media, and 100 11 per well of each dilution was transferred 
in duplicate (Huh-7) or quadruplicate (HMVEC-TERT) onto 96-well assay plates 
containing cell monolayers. The plates were transferred to BSL-4 containment, 
and the appropriate dilution of virus stock was added to test plates containing 
cells and serially diluted compounds. Each plate included four wells of infected 
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untreated cells and four wells of uninfected cells that served as 0% and 100% virus 
inhibition controls, respectively. After the infection, assay plates were incubated 
for 3 days (Huh-7) or 5 days (HMVEC-TERT) in a tissue culture incubator. Virus 
replication was measured by direct fluorescence using a Biotek HTSynergy plate 
reader. For virus yield assays, Huh-7 cells were infected with wild-type EBOV for 
1h at 0.1 plaque-forming units (PFU) per cell. The virus inoculum was removed 
and replaced with 100,11 per well of media containing the appropriate dilution of 
compound. At 3 days post-infection, supernatants were collected, and the amount 
of virus was quantified by endpoint dilution assay. The endpoint dilution assay was 
conducted by preparing serial dilutions of the assay media and adding these dilu- 
tions to fresh Vero cell monolayers in 96-well plates to determine the tissue culture 
infectious dose that caused 50% cytopathic effects (TCIDs9). To measure levels of 
viral RNA from infected cells, total RNA was extracted using the MagMAX-96 
Total RNA Isolation Kit and quantified using a quantitative reverse transcription 
polymerase chain reaction (qRT-PCR) assay with primers and probes specific for 
the EBOV nucleoprotein gene. 

EBOV assay in HeLa and HFF-1 cells. Antiviral assays were conducted in BSL-4 
at USAMRIID. HeLa or HFF-1 cells were seeded at 2,000 cells per well in 384-well 
plates. Ten serial dilutions of compound in triplicate were added directly to the cell 
cultures using the HP D300 digital dispenser (Hewlett Packard) in twofold dilution 
increments starting at 10|.M at 2h before infection. The DMSO concentration in 
each well was normalized to 1% using an HP D300 digital dispenser. The assay 
plates were transferred to the BSL-4 suite and infected with EBOV Kikwit at a mul- 
tiplicity of infection of 0.5 PFU per cell for HeLa cells and with EBOV Makona at a 
multiplicity of infection of 5 PFU per cell for HFF-1 cells. The assay plates were incu- 
bated in a tissue culture incubator for 48 h. Infection was terminated by fixing the 
samples in 10% formalin solution for an additional 48 h before immune-staining, 
as described in Supplementary Table 1. 

EBOV human macrophage infection assay. Antiviral assays were conducted in 
BSL-4 at USAMRIID. Primary human macrophage cells were seeded in a 96-well 
plate at 40,000 cells per well. Eight to ten serial dilutions of compound in triplicate 
were added directly to the cell cultures using an HP D300 digital dispenser in 
threefold dilution increments 2h before infection. The concentration of DMSO 
was normalized to 1% in all wells. The plates were transferred into the BSL-4 suite, 
and the cells were infected with 1 PFU per cell of EBOV in 100,11 of media and 
incubated for 1 h. The inoculum was removed, and the media was replaced with 
fresh media containing diluted compounds. At 48h post-infection, virus replica- 
tion was quantified by immuno-staining as described in Supplementary Table 1. 
RSV A2 antiviral assay. For antiviral tests, compounds were threefold serially 
diluted in source plates from which 100 nl of diluted compound was transferred to 
a 384-well cell culture plate using an Echo acoustic transfer apparatus. HEp-2 cells 
were added at a density of 5 x 10° cells per ml, then infected by adding RSV A2 ata 
titer of 1 x 10*° tissue culture infectious doses (TCIDs9) per ml. Immediately fol- 
lowing virus addition, 20 1] of the virus and cells mixture was added to the 384-well 
cell culture plates using a |1Flow liquid dispenser and cultured for 4 days at 37°C. 
After incubation, the cells were allowed to equilibrate to 25°C for 30 min. The 
RSV-induced cytopathic effect was determined by adding 201] of CellTiter-Glo 
Viability Reagent. After a 10-min incubation at 25°C, cell viability was determined 
by measuring luminescence using an Envision plate reader. 

High content imaging assay detecting viral infection. Antiviral assays were 
conducted in 384-or 96-well plates in BSL-4 at USAMRIID using a high-content 
imaging system to quantify virus antigen production as a measure of virus infec- 
tion. A ‘no virus’ control and a ‘1% DMSO’ control were included to determine the 
0% and 100% virus infection, respectively. The primary and secondary antibodies 
and dyes used for nuclear and cytoplasmic staining are listed in Supplementary 
Table 1. The primary antibody specific for a particular viral protein was diluted 
1,000-fold in blocking buffer (1 x PBS with 3% BSA) and added to each well of 
the assay plate. The assay plates were incubated for 60 min at room tempera- 
ture. The primary antibody was removed, and the cells were washed three times 
with 1 x PBS. The secondary detection antibody was an anti-mouse (or rabbit) 
IgG conjugated with Dylight488 (Thermo Fisher Scientific, catalogue number 
405310). The secondary antibody was diluted 1,000-fold in blocking buffer and 
was added to each well in the assay plate. Assay plates were incubated for 60 min at 
room temperature. Nuclei were stained using Draq5 (Biostatus) or 33342 Hoechst 
(ThermoFisher Scientific) for Vero and HFF-1 cell lines. Both dyes were diluted in 
1x PBS. The cytoplasm of HFF-1 (EBOV assay) and Vero E6 (MERS assay) cells 
were counter-stained with CellMask Deep Red (Thermo Fisher Scientific). Cell 
images were acquired using a Perkin Elmer Opera confocal plate reader (Perkin 
Elmer) using a x 10 air objective to collect five images per well. Virus-specific anti- 
gen was quantified by measuring fluorescence emission at a 488 nm wavelength 
and the stained nuclei were quantified by measuring fluorescence emission at a 
640 nm wavelength. Acquired images were analysed using Harmony and Acapella 
PE software. The Draq5 signal was used to generate a nuclei mask to define each 
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nuclei in the image for quantification of cell number. The CellMask Deep Red dye 
was used to demarcate the Vero and HFF-1 cell borders for cell-number quantita- 
tion. The viral-antigen signal was compartmentalized within the cell mask. Cells 
that exhibited antigen signal higher than the selected threshold were counted as 
positive for viral infection. The ratio of virus-positive cells to total number of 
analysed cells was used to determine the percentage of infection for each well on 
the assay plates. The effect of compounds on the viral infection was assessed as 
percentage of inhibition of infection in comparison to control wells. The resultant 
cell number and percentage of infection were normalized for each assay plate. 
Analysis of dose-response curve was performed using GeneData Screener soft- 
ware applying Levenberg-Marquardt algorithm for curve-fitting strategy. The 
curve-fitting process, including individual data point exclusion, was pre-specified 
by default software settings. R? value quantified goodness of fit and fitting strategy 
was considered acceptable at R? > 0.8. 

Virus assays. All virus infections were quantified by immuno-staining using 
antibodies that recognized the relevant viral glycoproteins, as described in 
Supplementary Table 1. 

Marburg virus assay. HeLa cells were seeded at 2,000 cells per well in a 384-well 
plate, and compounds were added to the assay plates. Assay plates were transferred 
to the BSL-4 suite and infected with 1 PFU per cell MARV, which resulted in 50% 
to 70% of the cells expressing virus antigen in a 48-h period. 

Sudan virus assay. HeLa cells were seeded at 2,000 cells per well in a 384-well plate, 
and compounds were added to the assay plates. Assay plates were transferred to 
the BSL-4 suite and infected with 0.08 PFU SUDV per cell, which resulted in 50% 
to 70% of the cells expressing virus antigen in a 48-h period. 

Junin virus assay. HeLa cells were seeded at 2,000 cells per well in a 384-well plate, 
and compounds were added to the assay plates. Assay plates were transferred to 
the BSL-4 suite and infected with 0.3 PFU per cell JUNV, which resulted in ~50% 
of the cells expressing virus antigen in a 48-h period. 

Lassa fever virus assay. HeLa cells were seeded at 2,000 cells per well in a 384-well 
plate, and compounds were added to the assay plates. Assay plates were transferred 
to the BSL-4 suite and infected with 0.1 PFU per cell LASV, which resulted in >60% 
of the cells expressing virus antigen in a 48-h period. 

Middle East respiratory syndrome virus assay. African green monkey (Chlorocebus 
sp.) kidney epithelial cells (Vero E6) were seeded at 4,000 cells per well in a 384-well 
plate, and compounds were added to the assay plates. Assay plates were transferred 
to the BSL-4 suite and infected with 0.5 PFU per cell of MERS virus, which resulted 
in >70% of the cells expressing virus antigen in a 48-h period. 

Chikungunya virus assay. U2OS cells were seeded at 3,000 cells per well in a 384- 
well plate, and compounds were added to the assay plates. Assay plates were trans- 
ferred to the BSL-4 suite and infected with 0.5 PFU per cell of CHIK, which resulted 
in >80% of the cells expressing virus antigen in a 48-h period. 

Venezuelan equine encephalitis virus assay. HeLa cells were seeded at 4,000 cells 
per well in a 384-well plate, and compounds were added to the assay plates. Assay 
plates were transferred to the BSL-4 suite and infected with 0.1 PFU per cell VEEV, 
which resulted in >60% of the cells expressing virus antigen in a 20-h period. 
Cytotoxicity assays. HEp-2 (1.5 x 10° cells per well) and MT-4 (2 x 10° cells 
per well) cells were plated in 384-well plates and incubated with the appropriate 
medium containing threefold serially diluted compound ranging from 15nM to 
100,000 nM. PC-3 cells (2.5 x 10° cells per well), HepG2 cells (4 x 10° cells per 
well), hepatocytes (1 x 10° cells per well), quiescent PBMCs (1 x 10° cells per well), 
stimulated PBMCs (2 x 10° cells per well), and RPTEC cells (1 x 10° cells per well) 
were plated in 96-well plates and incubated with the appropriate medium contain- 
ing threefold serially diluted compound ranging from 15 nM to 100,000 nM. Cells 
were cultured for 4-5 days at 37 °C. Following the incubation, the cells were allowed 
to equilibrate to 25°C, and cell viability was determined by adding Cell-Titer Glo 
viability reagent. The mixture was incubated for 10 min, and the luminescence sig- 
nal was quantified using an Envision plate reader. Cell lines were not authenticated 
and were not tested for mycoplasma as part of routine use in cytotoxicity assays. 
In vitro RSV RNA synthesis assay. RNA synthesis by the RSV polymerase was 
reconstituted in vitro using purified RSV L/P complexes and an RNA oligonucle- 
otide template (Dharmacon), representing nucleotides 1-14 of the RSV leader 
promoter*!-*3 (3/- UGCGCUUUUUUACG-3’). RNA synthesis reactions were 
performed as described previously, except that the reaction mixture contained 
250M guanosine triphosphate (GTP), 101M uridine triphosphate (UTP), 101M 
cytidine triphosphate (CTP), supplemented with 10 1Ci [«-*"P]CTP, and either 
included 101M adenosine triphosphate (ATP) or no ATP. Under these conditions, 
the polymerase is able to initiate synthesis from the position 3 site of the promoter, 
but not the position 1 site. The NIP metabolite of GS-5734 was serially diluted 
in DMSO and included in each reaction mixture at concentrations of 10, 30, or 
100|.M as specified in Fig. 1f. RNA products were analysed by electrophoresis on 
a 25% polyacrylamide gel, containing 7 M urea, in Tris-taurine-EDTA buffer, and 
radiolabelled RNA products were detected by autoradiography. 
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RSV A2 polymerase inhibition assay. Transcription reactions contained 25 1g 
of crude RSV RNP complexes in 30,1L of reaction buffer (50 mM Tris-acetate 
(pH 8.0), 120mM potassium acetate, 5% glycerol, 4.5mM MgCh, 3mM DTT, 
2mM EGTA, 50,1g ml“! BSA, 2.5 U RNasin, 201.M ATP, 100.M GTP, 100)1M 
UTP, 100j.M CTP, and 1.5:Ci [a-**P]ATP (3,000 Ci mmol~!)). The radiolabelled 
nucleotide used in the transcription assay was selected to match the nucleotide 
analogue being evaluated for inhibition of RSV RNP transcription. 

To determine whether nucleotide analogues inhibited RSV RNP transcription, 
compounds were added using a six-step serial dilution in fivefold increments. 
After a 90-min incubation at 30°C, the RNP reactions were stopped with 350 il of 
Qiagen RLT lysis buffer, and the RNA was purified using a Qiagen RNeasy 96 kit. 
Purified RNA was denatured in RNA sample loading buffer at 65°C for 10 min 
and run on a 1.2% agarose/MOPS gel containing 2 M formaldehyde. The agarose 
gel was dried, exposed to a Storm phosphorimaging screen, and developed using 
a Storm phosphorimager. 

Inhibition of human RNA polymerase II. For a 2511 reaction mixture, 7.511 
1 x transcription buffer (20 mM HEPES (pH 7.2-7.5), 100 mM KCI, 0.2mM EDTA, 
0.5mM DTT, 20% glycerol), 3mM MgCl, 100 ng CMV positive or negative control 
DNA, and a mixture of ATP, GTP, CTP and UTP was pre-incubated with various 
concentrations (0-500\1M) of the inhibitor at 30°C for 5 min. The mixture con- 
tained 5-25 1M (equal to Km) of the competing **P-labelled ATP and 400|1M of 
GTP, UTP, and CTP. The reaction was started by addition of 3.5 jl of HeLa and 
extract. After 1 h of incubation at 30°C, the polymerase reaction was stopped by 
addition of 10.6 11 proteinase K mixture that contained final concentrations of 
2.5 1g ul“? proteinase K, 5% SDS, and 25mM EDTA. After incubation at 37°C for 
3-12h, 1011 of the reaction mixture was mixed with 10 1 of the loading dye (98% 
formamide, 0.1% xylene cyanol and 0.1% bromophenol blue), heated at 75°C for 
5 min, and loaded onto a 6% polyacrylamide gel (8 M urea). The gel was dried for 
45 min at 70°C and exposed to a phosphorimager screen. The full length product, 
363 nucleotide runoff RNA, was quantified using a Typhoon Trio Imager and 
Image Quant TL Software. 

Inhibition of human mitochondrial RNA polymerase. Twenty nanomolar 
POLRMT was incubated with 20nM template plasmid (pUC18-LSP) containing 
POLRMT light-strand promoter region and mitochondrial (mt) transcription 
factors TFA (100 nM) and mtTFB2 (20nM) in buffer containing 10 mM HEPES 
(pH 7.5), 20mM NaCl, 10mM DTT, 0.1 mg ml“! BSA, and 10 mM MgCl,**. The 
reaction mixture was pre-incubated to 32°C, and the reactions were initiated by 
addition of 2.5|1.M of each of the natural NTPs and 1.5,.Ci of [7*P]GTP. After incu- 
bation for 30 min at 32°C, reactions were spotted on DE81 paper and quantified. 
Molecular modelling. A homology model of RSV A2 and EBOV polymerases 
were built using the HIV reverse transcriptase X-ray crystal structure (PDB:1RTD). 
Schrodinger Release 2015-1: Prime, version 3.9 (Schrédinger, LLC), default set- 
tings with subsequent rigid body minimization and side-chain optimization. Loop 
insertions not in 1RTD of greater than 10 amino acids were not built. 

qRT- PCR for in vivo studies. For quantitative assessment of viral RNA non- 
human primate plasma samples, whole blood was collected using a K3 EDTA 
Greiner Vacuette tube (or equivalent) and sample centrifuged at 2500 (+ 200) 
relative centrifugal force for 10 +2 min. To inactivate virus, plasma was treated 
with 3 parts (300 il) TriReagent LS and samples were transferred to frozen stor- 
age (—60°C to —90°C), until removal for RNA extraction. Carrier RNA and 
QuantiFast High Concentration Internal Control (Qiagen) were spiked into the 
sample before extraction, conducted according to manufacturer's instructions. The 
viral RNA was eluted in AVE buffer. Each extracted RNA sample was tested with 
the QuantiFast Internal Control RT-PCR RNA Assay (Qiagen) to evaluate the yield 
of the spiked-in QuantiFast High Concentration Internal Control. If the internal 
control amplified within manufacturer-designated ranges, further quantitative 
analysis of the viral target was performed. RT-PCR was conducted using an ABI 
7500 Fast Dx using primers specific to EBOV glycoprotein. Samples were run in trip- 
licate using a 511 template volume. For quantitative assessments, the average of the 
triplicate genomic equivalents (GE) per reaction were determined and multiplied 
by 800 to obtain GE ml plasma. Standard curves were generated using synthetic 
RNA. The limits of quantification for this assay are 8.0 x 10*— 8.0 x 10!°GE ml"! 
of plasma. Acceptance criteria for positive template control (PTC), negative tem- 
plate control (NTC), negative extraction control (NEC), and positive extraction 
control (PEC) are specified by standard operating procedure. For qualitative assess- 
ments, the limit of detection (LOD) was defined as C, 38.07, based on method 
validation testing. An animal was considered to have tested positive for detection 
of EBOV RNA when a minimum of 2 of 3 replicates were designated as ‘positive 
and PTC, NTC, and NEC controls met specified method-acceptance criteria. 
A sample was designated as ‘positive’ when the C, value was <LOD C,. 
Pharmacokinetic evaluations. Three uninfected male rhesus monkeys (Macaca 
mulatta) were used for the pharmacokinetic study. GS-5734 was formulated 
in solution at 5mg ml“! with 12% sulfobutylether-3-cyclodextrin in water, 


pH 3.5-4.0, and 2 ml kg! was administered by slow bolus (approximately 1 min) 
for a final dose of 10 mgkg 1. Blood samples for plasma and PBMCs were col- 
lected from a femoral vein/artery and were taken from each monkey over a 24-h 
period. Plasma samples were obtained at predose and at 0.083, 0.25, 0.5, 1, 2, 4, 
8, and 24h postdose. PBMC samples were obtained at 2, 4, 8, and 24h. Blood 
samples for plasma were collected into chilled collection tubes containing sodium 
fluoride/potassium oxalate as the anticoagulant and were immediately placed on 
wet ice, followed by centrifugation to obtain plasma. Blood samples for PBMC 
isolation were collected at room temperature into CPT vacutainer tubes contain- 
ing sodium heparin for isolation. Plasma and isolated PBMC samples were frozen 
immediately and stored at <60°C until analysed. 

For plasma analysis, an aliquot of 2511 of each plasma sample was treated with 

100 l of 90% methanol and acetonitrile mixture (1:1, v:v) and 10% water with 
20 nM 5-(2-aminopropyl)indole as an internal standard. Then, 100,11 of samples 
were filtered through an Agilent Captiva 96 well 0.2\1m filter plate. Filtered sam- 
ples were dried down completely for approximately 20 min and reconstituted with 
1% acetonitrile and 99% water with 0.01% formic acid. An aliquot of 10,11 was 
injected for LC-MS/MS using a HTC Pal autosampler. Analytes were separated 
on a Phenomenex Synergi Hydro-RP 30A column (75 x 2.0mM, 4.0j1m) using a 
Waters Acquity ultra performance LC (Waters Corporation, Milford, MA, USA), 
a flow rate of 0.26 ml min“, and a gradient from Mobile phase A containing 0.2% 
formic acid in 99% water and 1% acetonitrile to mobile phase B containing 0.2% 
formic acid in 95% acetonitrile and 5% water over 4.5 min. For MS/MS analysis, 
we used a Waters Xevo TQ-S in positive multiple reaction monitoring mode using 
an electrospray probe. Plasma concentrations of GS-734, alanine metabolite and 
Nuc were determined using an 8-point calibration curve spanning a concentra- 
tion range of over three orders of magnitude. Quality control samples were run at 
the beginning and end of the run to ensure accuracy and precision within 20%. 
Intracellular metabolites in PBMCs were quantified by LC-MS/MS as described 
above for in vitro activation studies. 
Radiolabelled tissue distribution. Six cynomolgus monkeys (Macaca fascicularis) 
were administered a single dose of [!4C]GS-5734 at 10mgkg ! (25.Ci kg”) by 
intravenous administration (slow bolus). Tissues were collected from three animals 
at 4 and 168 h postdose. The tissues were excised, rinsed with saline, blotted dry, 
weighed, and placed on wet ice. Tissues (testes, epididymis, eyes and brain; follow- 
ing homogenization) and plasma were analysed by liquid scintillation counting. 
Concentrations were converted to ng equivalents of GS-5734 per gram of sample. 
In vivo efficacy. Rhesus monkeys (Macaca mulatta) were challenged on day 0 
by intramuscular injection with a target dose of 1,000 PFU of EBOV Kikwit 
(Ebola virus H. sapiens-tc/COD/1995/Kikwit), which was derived from a clinical 
specimen obtained during an outbreak occurring in the Democratic Republic of 
the Congo (formerly Zaire) in 1995. Challenge virus was propagated from the 
clinical specimen using cultured cells (Vero or Vero E6) for a total of four pas- 
sages. Animals (3-6 years old) were randomly assigned to experimental treatment 
groups, stratified by sex (with equal number of males and females per group) and 
balanced by body weight, using SAS statistical software. Study personnel respon- 
sible for assessing animal health (including euthanasia assessment) and admin- 
istering treatments were experimentally blinded to group assignment of animals. 
The primary endpoint for efficacy studies was survival to day 28 following virus 
challenge. GS-5734 was formulated at Gilead Sciences in water with 12% sulfobu- 
tylether-3-cyclodextrin (SBE- 8-CD), pH adjusted to 3.0 using HCl. Formulations 
were administered to anaesthetized animals by bolus intravenous injection at a rate 
of approximately 1 min per dose in the right or left saphenous vein. The volume 
of all vehicle or GS-5734 injections was 2.0 ml kg! body weight. Animals 
were anaesthetized using intramuscular injection of a solution containing ketamine 
(100mg ml!) and acepromazine (10 mg ml ') at 0.1 ml kg”! body weight. 

Animals were observed at least twice daily to monitor for disease signs, and 
animals that survived to day 28 were deemed to be protected. Study personnel 
alleviated unnecessary suffering of infected animals by euthanizing clinically mor- 
ibund animals. The criteria used as the basis for euthanasia of moribund animals 
were defined before study initiation and included magnitude of responsiveness, 
reduced body temperature, and/or specified alterations to serum chemistry param- 
eters*’. Serum chemistry was analysed using a Vitros 350 Chemistry System (Ortho 
Clinical Diagnostics), and coagulation parameters were evaluated using a Sysmex 
CA-1500 coagulation analyser (Siemens Healthcare Diagnostics). Haematology 
analysis was conducted using a Siemens Advia 120 Hematology System with mul- 
tispecies software (Siemens Healthcare Diagnostics). On days in which GS-5734 
or vehicle dosing were scheduled with blood sample collection for clinical pathol- 
ogy or viraemia analysis, blood samples were collected immediately before dose 
administration. 

Viral genomic sequence analysis. Analysis of viral genomic sequence was con- 
ducted with the purpose of evaluating genomic sequence change patterns consist- 
ent with development of resistance against GS-5734. Attempts were made to obtain 
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sequence of RNA-dependent RNA polymerase gene (L) of Ebola virus from all 
EBOV-RNA-positive rhesus monkey plasma samples obtained during the efficacy 
studies. Deep sequencing screening of the L gene was completed using previously 
described methods**3”. Mutations and large subclonal events (>10% of popula- 
tion) were reviewed for: non-synonymous substitutions in treated animals that 
succumbed to infection, any substitution enriched in treated populations regardless 
of survival status, and clusters of substitutions in any treated animal. 

cDNA synthesis was performed using Superscript III First-Strand Synthesis 
System (Invitrogen). cDNA was amplified with Phusion Hot Start Flex DNA 
Polymerase (New England Biolabs) using overlapping 1,500-kb amplicons (primer 
information available upon request). After pooling and purification with AMPure 
XP Reagent (Beckman Coulter), PCR products were fragmented using the Covaris 
S2 instrument (Covaris). Libraries were prepared with the Illumina TruSeq DNA 
Sample Preparation kit (Illumina) on the Caliper ScicloneG3 Liquid Handling 
Station (PerkinElmer). After measurement by real-time PCR with the KAPA qPCR 
Kit (Kapa Biosystems), libraries were diluted to 4 or 10nM. Cluster amplification 
was performed on the Illumina cBot, and libraries were sequenced on the Illumina 
Nextseq or Illumina HiSeq 2500 using the 150 or 100 bp paired-end format. Viral 
assemblies were completed in DNAStar Lasergene nGen. Amplification primer 
removal, quality trimming, and trim-to-mer were performed on reads with a min- 
imum similarity to the reference of 93% (four-base mismatch). A target depth of 
1,200 is sought and single nucleotide polymorphisms at positions with fewer than 
200 read depth were removed from the analysis. A consensus change was defined as 
a change relative to the deposited EBOV sequence (GenBank: AY354458) present 
in >50% of the population. Below that threshold, single nucleotide polymorphisms 
were considered subclonal substitutions and part of a minority subpopulation of 
the virus. Consensus sequence for the region covered in this screening is available 
in GenBank and the accession numbers are provided in Extended Data Table 5. 
Animal care. Pharmacokinetic and radiolabelled tissue distribution studies in 
uninfected cynomolgus and rhesus macaques were conducted at Covance, Inc. 
Protocols were reviewed by an Institutional Animal Care and Use Committee 
(IACUC) at Covance. Efficacy experiments involving EBOV were performed 
in animal biosafety level 4 (ABSL-4) at USAMRIID. Research was conducted 
under an Institutional Animal Care and Use Committee approved protocol in 
compliance with the Animal Welfare Act, PHS Policy, and other federal statutes 
and regulations relating to animals and experiments involving animals. The 
facilities where this research was conducted are accredited by the Association 
for Assessment and Accreditation of Laboratory Animal Care, International 
and strictly adhere to principles stated in the Guide for the Care and Use of 
Laboratory Animals, National Research Council, 2011 (National Academies 
Press, Washington, DC). 


LETTER 


Statistics. Combined vehicle group from part 1 and 2 (n=6 animals total) was 
used as control group in all statistical comparisons of GS-5734 efficacy evalua- 
tions. The impact of GS-5734 treatment on the survival rates was estimated using 
Kaplan-Meier method and analysed by log-rank analysis using Dunnett-Hsu pro- 
cedure to adjust for multiple comparisons. The effect on systemic viral RNA levels 
was assessed by analysis of variance (ANOVA), comparing each GS-5734 treatment 
group with vehicle group using Dunnett’s test to adjust for multiple comparisons. 
Wilcoxon rank-sum test without adjustment for multiple comparisons was used 
to compare the effects of GS-5734 treatment on haematology, coagulation, and 
clinical chemistry parameters. All data met the statistical assumptions of the test 
performed. 
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Extended Data Figure 1 | Intracellular metabolism of GS-5734. 

a, Intracellular metabolite profile in human macrophages. Following a 2-h 
pulse incubation (black bar at top of y axis) of human monocyte-derived 
macrophages with 1 1M GS-5734 (mean + s.d., from three donors). 
GS-5734 is rapidly metabolized and not detected in cells. Transient 
exposure to the intermediate alanine metabolite (Ala-Met) is observed, 
followed by persistent Nuc exposure. The pharmacologically active NTP 
is formed quickly, achieving a maximum intracellular concentration at 4h 
and persisted with a half-life of 16 + 1h in the three donors. Intracellular 
concentration was estimated on the basis of an intracellular volume of 

1 pl per cell. b, Efficiency of GS-5734 activation in human and rhesus 
cells in vitro. Intracellular NTP concentrations formed in human and 
rhesus PBMCs, monocytes, and monocyte-derived macrophages during 

a 2-h incubation with 11M GS-5734 (results are the mean + s.d. of two 
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(PBMC and monocyte) to six (macrophage) independent experiments 
performed with cells from different donors). Intracellular concentrations 
were estimated on the basis of a cell volume of 0.2 pl per cell for PBMCs 
and monocytes and 1 pl per cell for macrophages. c, Intracellular NTP 
levels required for inhibition of EBOV replication in cell culture. The 
mixture of GS-5734 and its diastereomer on phosphorous was incubated 
continuously for 72h at 141M and levels of intracellular NTP were 
determined (results are the average of duplicate incubations performed 

in each cell type; two independent studies were performed in HMVEC 
isolated from different donors). The corresponding EBOV ECs values for 
the prodrug diastereomeric mixture were 100, 184, and 121 nM in human 
macrophages, HeLa, and HMVEC, respectively, suggesting that an average 
intracellular NTP concentration of approximately 51M is required for 50% 
inhibition of EBOV in vitro. 
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Extended Data Figure 2 | Inhibition of EBOV Makona replication by infectious dose that caused 50% infection (TCIDs0) was determined. 
GS-5734. Huh-7 cells infected with wild-type EBOV (Makona) were Independently, total RNA was extracted from infected cells and EBOV 
incubated for 3 days in the presence of serial dilutions of GS-5734. The RNA levels were quantified using a nucleoprotein (NP) gene-specific qRT- 
amount of infectious virus produced was quantified by endpoint dilution PCR. Values represent mean + s.d. of logio-transformed values, 


assay of culture media on fresh Vero cell monolayers and the tissue culture = 4 replicates. 
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Extended Data Figure 3 | Homology model of RSV A2 (cyan) and EBOV (coral) polymerase based on HIV reverse transcriptase (PDB: 1RTD) with 
NTP (green and red representing the nucleoside and triphosphate portion, respectively). 
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Extended Data Figure 4 | Clinical signs of disease in individual rhesus daily scores were converted to colour code, with darker colours indicative 
monkeys exposed to Ebola virus. Animals were observed multiple times of more severe disease signs. The schematic was truncated to emphasize 
each day and were subjectively assigned a clinical disease score ranging clinical scores during the acute disease phase, and none of the animals 
from 0 to 5 based on responsiveness, posture, and activity. Maximum exhibited clinical disease signs outside of the times that are shown. 
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Extended Data Figure 5 | Amelioration of EVD clinical pathology by 
GS-5734 in rhesus monkeys. a—c, Group mean (n = 6 per group) values 
of p-dimer (a), activated partial thromboplastin time (b), and lipase (c). 
Black (open symbols), vehicle; red, 3 mgkg~' dO; green, 3mgkg ! d2; 


Time after challenge (d) 
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blue, 10/3 mg kg~! d2; orange, 10/3 mg kg”! d3; black (closed symbols), 
10mgkg! d3. Error bars omitted for clarity; x axes truncated at day 15. 
*P <0.05 for comparison of mean change from d0 of vehicle and 
10mgkg~! d3 groups at day 7. 
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Extended Data Table 1 | /n vitro cytotoxicity of GS-5734 and Nuc in human cell lines and primary cells 


Human cell lines 
HEp-2 
HepG2 
PC-3 
MT-4 
Human primary cells 
Hepatocytes 
Renal proximal tubular epithelial cells (RPTEC) 
Quiescent PBMCs 
Stimulated PBMCs 


GS-5734 


6.0+1.5 
3.7 40.2 
8.9+1.6 
1.7404 


2.54 0.6 
12.9+6.2 
> 20 
14.8458 


CC50 (uM)* 


Nuc 


> 100 
> 100 
> 100 
69.3 + 25.7 


> 100 
> 100 
> 100 
> 100 
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Puromycin 


0.53 + 0.10 
0.73 + 0.01 
0.52 +0.11 
0.12 + 0.03 


1.5+0.6 
1.140.3 
6.8 + 1.4 
1640.2 


*Drug concentrations reducing the cell viability by 50% (CCso) are presented. All CCso values represent the mean-+s.d. of at least two independent experiments. Puromycin was included 


in experiments as a positive control for cytotoxicity. 
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Extended Data Table 2 | Individual plasma viral RNA (logio(copies ml—1)) 


Plasma Viral RNA on Study Day 


Treatment ee 
Description Animal # 0 2 3 4 5 6 vi 8 9 10 12 14 18 21/22 28/29 
Vehicle 
1 ND ND ND 6.6 9.0 - 10.0 - 9.5 
2 ND ND ND 5.9 8.9 - 9.8 
3 ND ND 6.5 8.2 8.5 - 8.6 
4 ND ND DET 66 8.4 - 8.4 - 8.1 
5 ND ND 5.8 88 10.0 10.3 
6 ND ND 5.4 74 9.4 - 9.2 8.7 
GS-5734 3 mg/kg DO 
1 ND ND ND DET 6.0 - 7.3 - 74 7.3 
2 ND ND ND ND 4.9 - 6.8 - 9.9 
3 ND ND ND DET 5.5 - 6.5 - 5.8 - DET ND - ND ND 
4 ND ND ND ND 4.9 - 5.8 - 5.6 - DET ND - ND ND 
5 ND ND ND DET 57 - 9.1 - 9.1 8.6 
6 ND ND DET 7.2 9.3 - 
GS-5734 3 mg/kg D2 
1 ND ND ND DET 6.0 - 6.6 - 6.8 - 6.9 6.0 - ND ND 
2 ND ND ND 6.4 7.6 - 7.1 - 6.8 - DET ND ND 
3 ND ND ND DET 66 - 7.5 - 8.2 - 5.2 ND - ND ND 
4 ND ND ND 5.1 7.0 - 74 - 6.6 - 
5 ND ND 4.9 6.9 8.1 - 8.1 - 6.6 - 5.2 ND - ND ND 
6 ND ND ND DET 5.5 - 6.7 - 5.9 - ND ND - ND ND 
GS-5734 10/3 mg/kg D2 
1 ND DET 5.2 5.8 7.9 - 8.2 - 8.0 
2 ND ND ND ND DET - DET - ND - ND ND - ND ND 
3 ND ND ND DET 6.1 - 7.8 - 7.5 
4 ND ND ND 4.9 5.5 - 6.1 - DET - ND ND - ND ND 
5 ND ND ND ND DET - 8.2 - 8.2 
6 ND ND ND §.3 7.1 - 6.9 8.1 
GS-5734 10/3 mg/kg D3 
1 ND ND DET 54 6.5 - 6.5 - 5.0 - ND ND - ND ND 
2 ND ND 5.3 6.2 ital - 6.8 - 6.0 - ND ND - ND ND 
3 ND ND DET 5.1 6.9 - 7.0 - 7.0 - 6.7 ND - ND ND 
4 ND ND ND DET 7.0 - 8.1 - 8.3 - 6.2 DET - ND ND 
5 ND ND ND ND DET - ND - 5:5 - ND ND - ND ND 
6 ND ND DET 58 6.6 - 6.2 - 6.8 - ND ND - ND ND 
GS-5734 10 mg/kg D3 
1 ND ND DET 57 6.1 - 6.0 - DET - ND ND - ND ND 
2 ND ND ND DET DET - DET - 5.6 - ND ND - ND ND 
3 ND ND ND 6.7. DET - ND - ND - ND ND - ND ND 
4 ND ND ND ND DET - ND - ND - ND ND - ND ND 
5 ND ND ND ND ND - ND - ND - ND ND - ND ND 
6 ND ND DET 56 6.9 - 5.5 - 5.9 - ND ND - ND ND 


-, sample not collected (days 6, 8, 10, and 18 were unscheduled samplings of succumbed animals only). DET, detectable, but below the lower limit of quantitation (8.0 x 10* copies ml 1); ND, not 
detected, that is, below limit of detection. 
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Extended Data Table 3 | Summary and statistical analysis of plasma viral RNA 


Day Vehicle 
3. 4.77 

4 7.24 

5 9.05 

7 9.19 

9 8.76 

12 - 


Plasma Viral RNA, mean logio copies ml (P value*) 


GS-5734 GS-5734 GS-5734 

3 mg/kg 3 mg/kg 10/3 mg/kg 

DO D2 D2 

3.32 (0.019) 3.32 (0.020) 3.36 (0.023) 
4.66 (0.001) 5.52 (0.024) 4.49 (0.001) 

6.04 (<0.001) 6.82 (0.002) 6.07 (<0.001) 

7.09 (0.013) 7.24 (0.015) 7.00 (0.007) 

7.55 (0.351) 6.82 (0.132) 6.30 (0.065) 

4.90 (NA) 5.05 (NA) 3.00 (NA) 


GS-5734 
10/3 mg/kg 
D3 

4.33 (0.454) 
5.06 (0.005) 
6.52 (0.001) 
6.28 (0.001) 
6.42 (0.072) 
4.14 (NA) 
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GS-5734 

10 mg/kg 

D3 

3.63 (0.062) 
4.81 (0.002) 
5.12 (<0.001) 
4.24 (<0.001) 
4.22 (0.001) 


3.00 (NA) 


*P values are from ANOVA comparing each GS-5734 treatment group with vehicle group using Dunnett’s test to adjust for multiple comparisons (n=6 animals per group, PCR sample assays 
performed in triplicate). EBOV RNA values reported as ‘<LOD’ were substituted as 10° RNA copies ml~!, and values reported as ‘>LOD, <LLOQ’ were substituted as LLOQ of 8.0 x 104 RNA 
copies ml-! for computation purpose. Statistically significant P values (P< 0.05) are highlighted in bold. NA, not applicable, owing to no survivors in vehicle group. 
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Extended Data Table 4 | Statistical summary of selected clinical pathology parameters 


Mean Change from Baseline, Day 7 (P value compared with combined vehicle group*) 


GS-5734 GS-5734 GS-5734 GS-5734 GS-5734 

3 mg/kg 3 mg/kg 10/3 mg/kg 10/3 mg/kg 10 mg/kg 
Parameter Vehicle DO D2 D2 D3 D3 
Platelet count (10°/ uL) 279 -118 (0.012) -202 (0.055) -155 (0.055) “98 (0.014) -65 (0.008) 
PT (sec) 5.0 1.3 (0.01) 3.2 (0.27) 1.6 (0.06) 2.5 (0.02) 1.7 (0.01) 
APTT (sec) 47.7 12.6 (0.012) 19.3 (0.014) 14.3 (0.008) 15.2 (0.008) 9.8 (0.008) 
Fibrinogen (mg/dL) 2.5 -4.7 (0.012) -5.5 (0.008) -5.0 (0.014) -5.0 (0.014) -4.4 (0.008) 
TT (sec) 50.2 -1.4 (0.012) 2.6 (0.008) 1.4 (0.008) 3.6 (0.008) -1.5 (0.008) 
Antithrombin (%) -39.6 -6.1 (0.012) -10.3 (0.008) -7.9 (0.008) 5.6 (0.008) 3.1 (0.008) 
D-dimer (mg/dL) 1.15 0.13 (0.012) -0.09 (0.008) 0.11 (0.008) -0.12 (0.005) 0.02 (0.007) 
ALT (U/L) 340 14 (0.012) 24 (0.008) 116 (0.083) 28 (0.008) 32 (0.008) 
AST (U/L) 1425 273 (0.014) 206 (0.014) 90 (0.020) 157 (0.014) 36 (0.014) 
ALP (U/L) 1238 -69 (0.012) -74 (0.008) 7 (0.008) 8 (0.008) -66 (0.008) 
CRK (U/L) 5420 1277 (0.020) 1002 (0.014) 841 (0.014) 682 (0.014) 96 (0.014) 
GGT (U/L) 146 -12 (0.012) -13 (0.008) -2 (0.008) 1 (0.008) -12 (0.008) 
LDH (U/L) 8391 1006 (0.020) 2263 (0.014) 2358 (0.014) 2439 (0.014) 352 (0.014) 
Bilirubin (mg/dL) 1.3 0 (0.071) 0 (0.048) 0 (0.048) 0 (0.048) 0 (0.048) 
BUN (mg/dL) 60 0 (0.021) 1 (0.028) 2 (0.036) 5 (0.055) 1 (0.021) 
Creatinine (mg/dL) 1.80 0.12 (0.015) 0.27 (0.017) 0.18 (0.014) 0.27 (0.014) 0.43 (0.066) 
Lipase (U/L) 205 17 (0.14) 34 (0.12) -17 (0.055) 36 (0.12) -12 (0.036) 
Triglycerides (mg/dL) 538 -7 (0.012) 52 (0.008) 63 (0.008) 420 (0.083) -6 (0.008) 
CRP (mg/dL) 48.6 48.8 (0.83) 43.8 (1.0) 41.2 (0.31) 35.3 (0.24) 13.2 (0.008) 
Albumin (g/dL) -1.5 -0.8 (0.012) -1.2 (0.170) -0.8 (0.036) -0.8 (0.022) -0.7 (0.008) 
Total protein (mg/dL) -1.1 -0.5 (0.034) -0.9 (0.27) -0.6 (0.17) -0.4 (0.035) -0.4 (0.008) 
Chloride (mEq/dL) -14 -3 (0.011) -5 (0.008) -6 (0.013) -6 (0.067) 0 (0.008) 
Phosphate (mEq/dL) 0.2 -2.1 (0.036) -2.5 (0.021) -1.5 (0.12) -1.0 (0.65) -0.3 (0.93) 
Sodium (mg/dL) -17 -8 (0.019) -10 (0.042) -9 (0.054) -7 (0.042) -5 (0.014) 


*Wilcoxon rank-sum test without adjustment for multiple comparisons using a combined vehicle group as a control group for the analysis (n=6 animals per group). 
Statistically significant P values (P< 0.05) are highlighted in bold. 

ALP, alkaline phosphatase; ALT, alanine aminotransferase; APTT, activated partial thromboplastin time; AST, aspartate aminotransferase; BUN, blood urea nitrogen; 
CRK, creatine kinase; CRP, C-reactive protein; GGT, gamma glutamyl transferase; LDH, lactate dehydrogenase; PT, prothrombin time; TT, thrombin time. 
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Extended Data Table 5 | L gene deep sequencing screening sample description and metrics 


Treatment 
Description 


Vehicle 


3 mg/kg DO 
3 mg/kg DO 


3 mg/kg D2 


3 mg/kg D2 


3 mg/kg D2 


3 mg/kg D2 


10/3 mg/kg 
D2 


10/3 mg/kg 
D3 
10/3 mg/kg 
D3 


10 mg/kg 


Animal 
# 


6 


Survival 
Day Outcome 


7 Deceased 


7 Deceased 
5 Deceased 


if Survived 


9 Survived 


5 Survived 


9 Deceased 


5 Deceased 


9 Survived 


7 Survived 


5 Survived 


Genbank 
Accession 
Number 


KU321182 


KU321189 


KU321084 


KU321152 


KU321162 


KU321165 


KU321088 


KU321098 


KU321149 


KU321172 


KU321154 


Polymerase 


% L Gene Amino Acid and 
Coverage Codon Changes 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 


81.2 


100.0 


99.3 


86.5 


W (TGG) @191 
(T+G) 


E (GAA) @2173 
E (GAg) 

G (GGT) @1160 
G (GGc) 

W (TGG) @191 
(T+G) 


Q (CAA) @805 
(+AA) 


W (TGG) @191 
(T+G) 


W (TGG) @191 
(T+G) 


Q (CAA) @1755 
Q (CAG) 


W (TGG) @191 
(T+G) 


K (AAG) @341 K 
(AAa) 

| (ATC) @348 S 
(AgC) 

K (AAA) @1387 
K (AAg) 

F (TTT) @1827 
(TT-) 

K (AAG) @659 N 
(AAt) 

Q (CAA) @805 
(+AA) 


% of 
Population 
with 
Change 


21.9 


99.5 
44.5 


21.8 


26.6 


13.8 


23.4 


99.0 


13.9 


24.2 
26.7 
22.3 
28.3 
20.6 
28.0 


40.2 


iSNV Description 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Synonymous substitution unlikely 
selection pressure. 


Synonymous substitution unlikely 
selection pressure. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 
and controls > 2% of population. Causes 
a frameshifting insertion at the end of a 
large homopolymer. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Synonymous mutation unlikely selection 
pressure. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Non-coding substitution, unlikely 
selection pressure. 


Synonymous substitution unlikely 
selection pressure. 


Non-synonymous substitution tolerated 
by survivor. 


Synonymous substitution unlikely 
selection pressure. 


Indel causes a frameshifting deletion at 
the end of a large homopolymer region. 


Non-synonymous substitution tolerated 
by survivor. 


Volatile sub clonal substitution observed 
in most samples in all treatment groups 

and controls >2% of population. Causes 
a frameshifting insertion at the end of a 

large homopolymer. 


Genomic sequence analysis was conducted on all samples containing quantifiable concentrations of viral RNA (that is, >LLOQ as assessed by quantitative real-time PCR). 
Sequences for which no change (defined as >2% of the population) was noted from the reference sequence are not shown. 
iSNV, intra-host Single Nucleotide Variant. 
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Hepatitis B virus X protein identifies the Smc5/6 
complex as a host restriction factor 


Adrien Decorsiére!*, Henrik Mueller!+*, Pieter C. van Breugel'+*, Fabien Abdul!*, Laetitia Gerossier?, Rudolf K. Beran’, 
Christine M. Livingston’, Congrong Niu’, Simon P. Fletcher’, Olivier Hantz? & Michel Strubin! 


Chronic hepatitis B virus infection is a leading cause of cirrhosis 
and liver cancer’. Hepatitis B virus encodes the regulatory HBx 
protein whose primary role is to promote transcription of the 
viral genome, which persists as an extrachromosomal DNA circle 
in infected cells*-°. HBx accomplishes this task by an unusual 
mechanism, enhancing transcription only from extrachromosomal 
DNA templates®. Here we show that HBx achieves this by hijacking 
the cellular DDB1-containing E3 ubiquitin ligase to target the 
‘structural maintenance of chromosomes’ (Smc) complex Smc5/6 
for degradation. Blocking this event inhibits the stimulatory 
effect of HBx both on extrachromosomal reporter genes and on 
hepatitis B virus transcription. Conversely, silencing the Smc5/6 
complex enhances extrachromosomal reporter gene transcription 
in the absence of HBx, restores replication of an HBx-deficient 
hepatitis B virus, and rescues wild-type hepatitis B virus in a 
DDB1-knockdown background. The Smc5/6 complex associates 
with extrachromosomal reporters and the hepatitis B virus genome, 
suggesting a direct mechanism of transcriptional inhibition. These 
results uncover a novel role for the Smc5/6 complex as a restriction 
factor selectively blocking extrachromosomal DNA transcription. 
By destroying this complex, HBx relieves the inhibition to allow 
productive hepatitis B virus gene expression. 

HBx stimulates transcription of the hepatitis B virus (HBV) genome 
as well as of any reporter gene provided the DNA template remains 
extrachromosomal. When the template is integrated into the chro- 
mosome, the HBx effect is lost*®. Previous studies suggested that this 
unusual property requires HBx to bind the DDB1 subunit of the DDB1- 
containing E3 ubiquitin ligase to target an unknown host cell factor for 
ubiquitin-mediated degradation®’. To identify the HBx target(s), we 
employed a tandem affinity purification (TAP) strategy. As baits, we 
used wild-type HBx, a DDB1-binding-defective HBx(R96E) mutant’, 
the woodchuck hepatitis virus HBx counterpart, WHx, that exhibits 
similar properties*!°, and the unrelated paramyxovirus SV5-V protein. 
SV5-V binds DDB1 in a manner similar to HBx to target STAT 1 
for ubiquitin-mediated degradation®!!!”. DDB1 was the most abun- 
dant protein species co-purifying with all viral proteins, except the 
HBx(R96E) mutant (Extended Data Fig. 1a, b). Other components 
of the E3 ligase were also identified. This provides further evidence 
that HBx predominantly exists in association with the DDB1 E3 ligase 
complex®!*-!4, However, no protein behaving as a potential HBx 
substrate was recovered. Similarly, STAT1 did not co-purify with SV5-V 
under these conditions (Extended Data Fig. 1b). 

We therefore turned to an alternative strategy. We postulated that 
HBx may detectably interact with its substrate only when bound to 
DDB1 and when substrate ubiquitination is blocked. With this in 
mind, we took advantage of our previous finding that a covalent link 


between HBx and DDB1 forces interaction between the two proteins, 
thereby preventing HBx binding to endogenous DDB1®*!. We fused 
TAP-tagged HBx to wild-type DDB1 or toa DDB1 mutant that could 
not incorporate into the E3 ligase complex (Fig. 1a). The expectation 
was that fusion of HBx to the DDB1 mutant would prevent ubiq- 
uitination of the substrate and thereby stabilize the HBx-substrate 
interaction. A proof-of-principle experiment demonstrated that this 
did indeed occur with SV5-V and its target STAT1 (Extended Data 
Fig. 1c, d). Intriguingly, all six subunits of the Smc5/6 complex 
(Extended Data Fig. le), which plays a broad role in DNA repair and 
chromatin organization'>'’, were only recovered with the HBx-DDB1 
mutant fusion (Fig. 1b and Supplementary Table 1). The selective recov- 
ery of this complex with the HBx-DDB1 mutant, and its lack of inter- 
action with the corresponding SV5-V-DDB1 construct, was confirmed 
by co-immunoprecipitation experiments (Fig. 1c). Hence, the Smc5/6 


a c 
Strategy HBx Svsv 
ys Xp DDB1 DDB1 
x _ WT Mut. WT Mut. 
os Fusions 
DDB1 
rene is Smc5 
Co Ce | —— a 
(1/10) 
covet ink 7 SSSs os 
WT Mutant ~ 
Cov. Cov. r 
= aia ° ~—-—= 
HBx 2 | 18 | 3 | 28 L 
DDB1 39 | 38 | 34 | 35 
CUL4A 2/a]-|- 
SMC complex oo 
Sme5 - | - | 26 | 25 
Smc6 - | - | 17 | 22 
Nset = = 3 12 
Nse2 = = 7 23 
Nse3 - | - | 44 | 45 
Nse4 ad = 2 6 


Figure 1 | Identification of the Smc5/6 complex as an HBx interacting 
partner. a, Strategy. See text for description. b, The HBx-DDB1 fusions 
and associated proteins were isolated by TAP and subjected to mass 
spectrometric analysis (see Methods). The number of unique peptides (Pep.) 
and percentage of total protein sequence covered (Cov. (%)) are indicated. 
Nse1-4 are subunits of the Smc5/6 complex. WT, wild type. c, HBx and 
SV5-V fused to the indicated DDB1 were purified from transfected HepG2 
cells. The amount recovered (IP) and the presence of Smc5, Smc6 and 
STAT1 (Co-IP) were assessed by western blot. The fusion proteins were 
detected using anti-Flag antibodies; expression was too low for detection in 
crude extracts (input). For gel source data, see Supplementary Fig. 1. 
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Figure 2 | HBx stimulates reporter gene activity by degrading Smc5/6 
to prevent its binding to episomal DNA. a, HepG2 cells were transfected 
with a luciferase reporter gene driven by the HBV Enhancer I (Enh1) 

and then transduced with lentiviral vectors expressing green fluorescent 
protein (GFP) or the indicated GFP-tagged viral proteins. Luciferase assay 
and western blot were performed 6 days after transfection. Luciferase 
activity is relative to GFP, which was set to 1. *P < 0.05 by paired analysis 
of variance (ANOVA) relative to GFP control. NS, not significant 

(P > 0.05). b, Same as in a except cells were treated or not with 51M 
MLN4924 starting 6h before transduction. Analysis was performed 3 days 
later. *P < 0.05 by t-test relative to matched no MLN4924 control. c, Same 
as in a but with cells expressing a doxycycline (dox)-inducible wild-type 
or mutant (mut) HBx-DDB1 fusion. Analysis performed as in a. See also 
Extended Data Fig. 2e. d, Activity of a transiently transfected (episomal) 
HBV Enh1-driven firefly reporter (HBV-FLuc) and a chromosomally 
integrated EFla promoter-driven Renilla reporter (EFla-RLuc) in 


complex behaves as STAT 1 in the SV5-V experiment, strongly suggest- 
ing that the complex is a substrate of HBx. 

This hypothesis and the key role of the Smc5/6 complex in mediat- 
ing the HBx stimulatory activity were confirmed as follows. Western 
blot analysis revealed that Smc5 and Smcé6 levels were strongly 
decreased in HepG2 cells expressing HBx (Fig. 2a). The same was 
observed with the woodchuck WHx, suggesting a conserved mech- 
anism (Fig. 2a). This correlated with the stimulatory effects of HBx 
and WHx on the activity ofa transiently transfected luciferase reporter 
construct, which, like the HBV genome, remains extrachromosomal®. 
In contrast, SV5-V triggered degradation of STAT1 but had no effect 
on Smc5 or Smc6 levels or reporter gene expression (Fig. 2a). HBx- 
mediated depletion of Smcé6 was rapid, occurring concomitantly with 
or preceding the HBx-mediated increase in reporter gene expression 
(Extended Data Fig. 2a), and was inhibited by exposure to the protea- 
some inhibitor MG132 (Extended Data Fig. 2b), consistent with a role 
of proteasomes in HBx activity'®. HBx and WHx had no impact on 
Smc5 and Smc6 messenger RNA levels, excluding an effect on tran- 
scription or mRNA stability (Extended Data Fig. 2c). Furthermore, 
pre-treatment of cells with MLN4924, an inhibitor of the E3 ubiq- 
uitin ligase family to which the DDB1 E3 ligase belongs!””°, abro- 
gated the effects of HBx and WHx both on reporter gene activity 


shSmc5 


+ - + 
shSmc6 
HepGz? cells transduced with GFP or GFP-HBx and expressing (+) or 
not (—) the indicated doxycycline-inducible shRNA. Analysis performed 
after 6 days of culture. *P < 0.05 by t-test relative to matched HBV-FLuc 
episomal no doxycycline GFP control. e, Same as in a but with HepG?2 cells 
co-transduced with GFP or an HA-tagged Nse4 subunit of the Smc5/6 
complex together with GFP or GFP-HBx as indicated. Luciferase and 
western blot analyses performed 6 days later. In parallel, HA-Nse4 binding 
to the episomal reporter (filled bars) or the B-actin gene (open bars) was 
monitored by ChIP using anti-HA antibodies (bottom). Data expressed as 
percentage of input DNA recovered. Note the reduced binding of HA-Nse4 
in the presence of HBx despite comparable expression levels, consistent 
with the protein associating to DNA only when incorporated into the 
Smc5/6 complex. See also Extended Data Fig. 3c. *P < 0.05 by paired 
ANOVA relative to GFP control. Data in a, b, d and e are mean +s.e.m.; 
n=3 independent experiments. 


and on Smc5 and Smcé6 protein levels (Fig. 2b and Extended Data 
Fig. 2d). This implicates the DDB1 E3 ubiquitin ligase activity in HBx 
function. Consistently, HBx retained transactivation activity and the 
ability to decrease Smc5 and Smcé protein levels when fused to wild- 
type DDB1 but not to the mutant that cannot incorporate into the E3 
ligase (Fig. 2c and Extended Data Fig. 2e). 

The above results are consistent with HBx and WHx using the 
DDB1 E3 ubiquitin ligase to target the Smc5/6 complex for ubiquitin- 
mediated proteasome degradation. They further suggest that this event 
explains the stimulatory effect of HBx on extrachromosomal reporter 
gene activity. To test the latter hypothesis, we monitored the conse- 
quence of depleting Smc5 or Smc6 by RNA interference. HepG2 cell 
lines were generated to express doxycycline-inducible short hairpin (sh) 
RNA targeting Smc5, Smc6, or Cardif (MAVS) as a control. Dual Renilla 
and firefly luciferase reporter genes driven by unrelated promoters were 
tested in pairwise combinations in these cells, with one being inte- 
grated into the chromosomes and the other transiently transfected, and 
therefore episomal. All three reporter constructs tested responded to 
HBx only as episomal templates (Fig. 2d and Extended Data Fig. 2f, g). 
This was consistent with our earlier observation that HBx stimulates 
extrachromosomal DNA templates selectively and independently of 
promoter type®. Depletion of Smc5 or Smc6, which destabilizes the 
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Figure 3 | HBx promotes HBV transcription in PHHs by preventing 
Smc5/6 binding to the viral genome. a, Purified PHHs were left 
uninfected or infected with wild-type or HBx-deficient (AX) HBV. HBeAg 
secretion, a marker for HBV gene expression, and indicated proteins 

were quantified 10 days later by ELISA and western blot. HBcAg is 

the viral capsid. HBeAg concentrations are relative to wild-type HBV, 
which was set to 100. See also Extended Data Fig. 6a. b, Confocal 
microscopy of control (left) and HBV-infected (right) PHHs stained 

for HBcAg (red) and Smcé6 (green). Cell nuclei visualized with 
4',6-diamidino-2-phenylindole (DAPI, blue). HBcAg-positive and 
-negative cells are indicated by yellow and white arrows, respectively. 

See also Extended Data Fig. 7. c, PHHs were mock transduced (none) 

or transduced 1 day before HBV infection with indicated shRNAs or 
GFP-HBx. HBeAg secretion and viral RNA production were assessed 

8 days later by ELISA and northern blot. Mean + s.e.m.; n = 4 independent 
experiments with three different PHH donors. *P < 0.05 by paired 
ANOVA relative to matched no shRNA control (none). d, Same as in 

c but cells infected with HBx-mutant HBV were examined for surface 
antigen (HBsAg) positivity 12 days after infection. Cell nuclei stained with 
Hoechst dye (blue). e, PHHs were mock transfected (none) or transfected 
with indicated siRNAs 4 days before HBV infection. HBeAg secretion was 
measured 13 days later. Mean + s.e.m.; n = 4 independent experiments 
with two different PHH donors. *P < 0.05 by paired ANOVA relative to no 
siRNA control (none). f, PHHs were infected as indicated. At 10 days after 
infection, the binding of Smc5/6 to repressed CHTA and active -actin 
genes, and to episomal HBV DNA was monitored by ChIP using anti-Nse4 
antibodies. Mean +s.e.m.; n = 3 independent experiments with different 
PHH donors and virus stocks. **P < 0.01 by unpaired ANOVA relative to 
WT HBV 8-actin control. 
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Figure 4 | HBV infection induces Smcé6 degradation in humanized 
mouse liver tissue. a, Fresh-frozen liver tissue sections from the indicated 
humanized mice were stained for human Smcé6 (green) and HBsAg (red). 
Blue, DAPI. Insets represent a digital zoom of selected areas. Note the 
absence of Smcé6 nuclear staining in the HBV-infected tissue. See also 
Extended Data Figs 8 and 9. Images are representative of six HBV-infected 
and three uninfected humanized mice. b, Western blot analysis for human 
Smcé6 and HBsAg expression in the indicated humanized liver samples. 
Human cytokeratin-18 (CK-18) served as a control. 


entire Smc5/6 complex”', stimulated expression of the three episomal 
constructs to levels approaching those measured in the presence of 
HBx, but remained without effect on the corresponding chromosomal 
reporters (Fig. 2d and Extended Data Fig. 2f, g). This recapitulated 
exactly the situation observed with HBx. Moreover, knockdown of 
Smc5 or Smc6 had no further effect on transcription stimulated by 
HBx, consistent with activation by HBx and by Smc5/6 knockdown 
occurring through the same pathway. HBx-mediated destruction of 
the Smc5/6 complex, which has been involved in cell cycle progres- 
sion and recombinational DNA repair??*4, did not alter the cell cycle 
or induce integration of the reporter gene into the chromosomes 
(Extended Data Fig. 3a, b). Instead, chromatin immunoprecipitation 
(ChIP) analysis revealed that the Smc5/6 complex binds to transiently 
transfected reporter plasmids but not to a chromosomally integrated 
reporter, and that HBx reduces this binding (Fig. 2e and Extended Data 
Fig. 3c). This is in line with recent in vitro studies”’. Thus, HBx stimu- 
lates gene transcription mainly, if not exclusively, by triggering degra- 
dation of the Smc5/6 complex to prevent its association with episomal 
DNA templates. 

The importance of these findings for HBV replication was exam- 
ined using an infection assay of primary human hepatocytes (PHHs). 
In this assay, mutant HBV lacking a functional HBx gene is fully 
competent to enter cells and deliver its circular DNA genome into 
the nucleus. However, no viral gene transcription is detected4 
(Fig. 3c and Extended Data Fig. 4a). We observed that cells infected at 
high efficiency (Extended Data Fig. 5) with wild-type HBV displayed 
decreased levels of Smc5 and Smc6, but not when infected with an 
HBx-defective virus (Fig. 3a, b and Extended Data Figs 6a and 7). 
Conversely, depletion of Smc6 by shRNA (Extended Data Fig. 6b) 
restored replication competency to the HBx-deficient virus without 
affecting genome copy number (Extended Data Fig. 4b) but had no 
effect on replication of the wild-type virus (Fig. 3c, d and Extended 
Data Fig. 6c). The same was observed in HBV-permissive HepaRG 
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cells (Extended Data Fig. 6d). Furthermore, siRNA-mediated silenc- 
ing of DDB1 blocked HBV transcription, and this effect was relieved 
by the concomitant downregulation of Smc6 or Smc5 (Fig. 3e and 
Extended Data Fig. 6e-i). Thus, a major function of the HBx-DDB1 
E3 ligase interaction is to target the Smc5/6 complex for destruction. 
ChIP analysis demonstrated that the Smc5/6 complex also associates 
with the HBV episomal DNA genome, and that binding was reduced 
in the presence of HBx (Fig. 3f). 

In summary, our results indicate that the Smc5/6 complex binds 
episomal DNA templates and inhibits their transcription. HBx, by 
destroying the complex, relieves the inhibition to allow productive 
HBV gene expression. Further supporting this notion, HBV-infected 
humanized mouse liver tissues show markedly reduced levels of 
Smc6 (Fig. 4 and Extended Data Figs 8 and 9). This points to a novel 
role for the Smc5/6 complex as a host restriction factor, and offers a 
potential new avenue for therapeutic intervention against HBV infec- 
tion. The Smc5/6 complex is, together with condensin and cohesin, 
one of the three related Smc complexes identified in eukaryotes”®. 
These complexes share a similar architecture and play fundamental 
roles in chromosome organization, segregation, and repair (reviewed in 
ref. 27). The Smc5/6 complex has well-recognized functions in homol- 
ogous recombination-mediated DNA repair!*®. In addition, the com- 
plex has been implicated in DNA replication, chromosome topology, 
and transcription, but little is known of its precise function!®-!77+. 
A role in host defence against viral infection has not previously been 
reported. It will be interesting to determine whether the restrictive 
activity of the Smc5/6 complex is connected to any of its cellular func- 
tions, and whether the complex exhibits antiviral activity against other 
episomal DNA viruses. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Expression plasmids. HBx fused at its carboxy (C) terminus via a 10-amino-acid 
linker (HMRSRSGLET) either to wild-type DDB1 or to the CUL4-binding- 
defective DDB1 mutant has been described*”*. The SV5-V-DDBI1 fusions were 
generated by replacing the sequence encoding HBx within the HBx-DDB1 fusions 
by the full-length SV5-V coding region’. Unless fused to DDB1, all viral proteins 
contain an amino (N)-terminal GFP tag and have been described®. For TAP, the 
proteins carried an N-terminal Flag tag followed by two Strep-tag II sequences 
(FS-tag)”’. For ChIP analysis, the full-length NSE4a coding region was amplified 
by PCR from a Namalwa B cell complementary DNA (cDNA) library and fused 
N-terminal to three tandem copies of the HA epitope. Proteins were expressed from 
a stably integrated tetracycline-inducible pTRE2hyg plasmid vector (Clontech) 
in Fig. 1b and Extended Data Fig. 1a, b, from the episomal Epstein-Barr virus- 
based expression vector EBS-PL™ in Fig. 1c and Extended Data Fig. 1d, from the 
tetracycline-inducible lentiviral vector pINDUCER20 (ref. 31) (a gift from 
S. Elledge) in Fig. 2c and Extended Data Fig. 2e, and from the lentivirus vector 
pWPT°” in all the other figures. 

Luciferase reporter constructs. The luciferase reporter genes were constructed into 
pcDNA3 (pGL3 in the case of HBV Enhancer I°) for transient transfection and into 
the self-inactivating lentiviral vector pWPT (pWPXL in the case of HBV Enhancer 
1°) for chromosomal integration. In Extended Data Fig. 3c, the EFl« episomal 
reporter was delivered using an integrase-defective (D116A) lentiviral vector™. 
RNA interference. The short hairpin RNAs (shRNAs) used for Smc5 and Smc6 
knockdown were cloned into the miR30 context as 116-nucleotide XhoI-EcoRI 
fragments into pINDUCER1O0 (ref. 31) and/or pGIPZ (Open Biosystems) len- 
tiviral vectors. The sequences were as follows, with the hairpin sequence in 
capital letters and flanking miR30 sequences shaded in bold type: shSmc5(1), 
gagcgGTGAGGTGAAAGAAGTGTTTCTtagtgaagccacagatgtaAGAAACACTTC 
TTTCACCTCATtgcct; shSmc5(2), gagegGTGCGAAACTTGTTACCGAATT 
tagtgaagccacagatgtaAATTCGGTAACAAGTTTCGCATtgcect; shSmc6(1), 
gagcgGTGAGCAGCTTTGTAAACGAATtagtgaagccacagatgtaATTCGTTTACAA 
AGCTGCTCATtgcect; shSmc6(2) (from Thermo Scientific (V3LHS_325916), 
gagcg CAGAGACAGTGCTACTAATCAAtagtgaagccacagatgtalT TGATTAGTAG 
CACTGTCTCTAtgect. 

The constructs used were shSmc5(2) and shSmc6(1) in Fig. 2d, shSmc5(1) 

and shSmc6(2) in Extended Data Fig. 2f, and shSmc6(2) in Extended Data 
Fig. 2g. The Cardif shRNA construct in pINDUCER1O0 was a gift from D. Garcin. 
The GAPDH shRNA construct in pGIPZ was purchased from Thermo Scientific 
(RHS4371). Expression was from pINDUCER1O in Fig. 2d and Extended Data 
Fig. 2f, g and from pGIPZ in Fig. 3c, d. Details of plasmid constructions are avail- 
able upon request. The sequences of the siRNAs used in the other figures can be 
found in Supplementary Table 2. 
Cell culture and establishment of stable cell lines. The human hepatoma cell 
lines HepG2 (ATCC HB-8065), HepG2‘* (ref. 33), and derivatives were grown 
at 37°C under 5% CO) in modified Eagle's medium (MEM; Life Technologies or 
Sigma-Aldrich) supplemented with 10% (vol/vol) fetal calf serum (Gibco), 1% (v/v) 
penicillin/streptomycin solution, 2mM t-glutamine, 1 mM sodium pyruvate, and 
0.1mM MEM non-essential amino acids solution (all from Life Technologies). In 
Extended Data Fig. 6d, HepaRG cells were cultured, induced to differentiate, and 
infected with HBV as described*!. The cell lines were not authenticated or tested 
for Mycoplasma contamination because of their direct purchase from ATCC or 
low passage number. 

The stable HepG2 cell lines used in Fig. 1b and Extended Data Fig. 1a, b that 
expressed the tetracycline-inducible transactivator rtTA and various FS-tagged 
proteins from a tetracycline-responsive promoter were established as follows. The 
HepG2"*t" parental cell line™* was plated at a density of 1 x 10° cells per 6-cm dish 
and transfected using FUGENE HD (Roche) with the relevant constructs cloned 
into pTRE2hyg (Clontech), which carries a hygromycin resistance marker. At day 3 
after transfection, cells were selected in medium containing 250,.g ml! hygro- 
mycin B (PAA Laboratories) and 100,.gml~! G418 (Geneticin, Gibco). Resistant 
colonies were picked and expanded in 48-well plates under the same selection 
conditions. Individual clones were assayed by reverse transcription (RT)-PCR 
and western blot analyses for transgene expression in the presence or absence of 
2g ml! doxycycline (Sigma Aldrich). 

The HepG2 cells in Fig. 2c expressing the doxycycline-inducible HBx-DDB1 
fusions from pINDUCER20 lentiviral vector carrying a neomycin resistance 
marker were obtained by selection for 2 days with 1mg ml“! G418 (Promega) 
starting at day 3 after transduction. 

The HepG2 cells in Fig. 2d and Extended Data Fig. 2f, g expressing doxycycline- 
inducible shRNAs against Smc5, Smc6, or Cardif from pINDUCER10 lentiviral 
vector were obtained by selection with 101g ml~! puromycin (Calbiochem) for 
4 days starting at day 3 after transduction. The resulting puromycin-resistant 


cells were then cultured for 2 weeks in the absence or presence of 2.5 4g ml“! 
doxycycline before transduction of GFP or GFP-HBx to maximally induce shRNA 
expression. Fluorescence microscopy indicated that >90% of the cells were express- 
ing turboRFP (tRFP) from the tRFP-shRNA cassette*!. 

The HepG2.2.15 (ref. 4), HepAD38 (ref. 35), and HepG2-H1.3x- (ref. 4) stable 

cell lines used to produce HBV virions carrying either a wild-type genome or a 
genome bearing a defective HBx gene have been described. They were maintained 
in Dulbecco’s modified Eagle’s medium/F-12 (DMEM/F12) complemented with 
10% fetal calf serum, 1% non-essential amino acids, 50 1g ml! kanamycin, and 
200 1g ml“! geneticin. 
Transfection, transduction, and reporter gene assay. Transfection of plasmid 
DNA in HepGz2 cells was performed using X-tremeGENE HP DNA Transfection 
Reagent (Roche) following the manufacturer’s instructions. Transfection of 
siRNA duplexes in HepaRG cells was performed using DharmaFECT-1 (Thermo 
Scientific) transfection reagent and in PHHs using Lipofectamine RNAiMax (Life 
Technologies). Recombinant lentivirus production and transduction of HepG2 
cells were performed as described previously”°. For PHH transduction, lentivirus 
supernatants were concentrated by ultracentrifugation at 130,000 g for lh ina 
SW32 Beckman rotor at 4°C and transduction was performed overnight in the 
presence of 10ngml’ of EGF*. For luciferase reporter gene assay and western 
blot analysis, cells were typically seeded at a density of about 6 x 10° cells per 
30 mm diameter well (1 x 10° cells per square centimetre) and transfected the 
next day with 30 ng of reporter plasmid DNA and 2 1g of empty EBS-PL vector. 
For the ChIP experiment in Fig. 2e, 3 or 30 ng of reporter plasmid were used 
with similar results. 

In Fig. 2d and Extended Data Fig. 2f, g, cells that had been cultured in the 
absence or presence of doxycyclin for 2 weeks to induce shRNA expression from 
pINDUCER1O0 were first transduced and the next day transfected with, respectively, 
lentiviral and plasmid constructs carrying the Renilla and firefly luciferase reporter 
genes as indicated in the figures. Cells were trypsinized the following day, washed 
with phosphate-buffered saline (PBS), replated in 6- or 24-well dishes at a density 
of about 6 x 10* cells per square centimetre and transduced 6h after replating 
with lentiviral vectors encoding GFP or the GFP-tagged viral proteins. In Fig. 2b, 
dimethylsulphoxide (DMSO) or 51M MLN4924 (Active Biochem) was added 6h 
before transduction and culture medium was replaced daily with fresh medium 
containing (or not) MLN4924 for 3 days. In Fig. 2c, d and Extended Data Fig. 2f, g, 
cells were constantly grown in the absence or presence of 2,1gml~! doxycycline. 
Cell lysates were prepared 6 days later (3 days in Fig. 2b) for luciferase assay and 
western blot analysis. Luciferase activities were measured using the Luciferase 
Assay System (Promega) or Dual-Luciferase Reporter Assay System (Promega) 
according to the manufacturer's instructions. The activities were normalized to 
protein concentrations as measured by the Bradford assay (Bio-Rad). 
Purification of Flag-StrepII-tagged proteins. The two-step purification scheme 
was adapted from ref. 29. The HepG2"*™ cell lines conditionally expressing 
FS-tagged proteins from a doxycycline-inducible promoter were seeded on thirty 
15-cm dishes and allowed to reach 90% confluence. Expression of the FS-tagged 
proteins was then induced by addition of doxycycline (21g m1~') for 48h. Cells 
were rinsed once with 20 ml PBS (per plate) and scraped off after incubation on ice 
for 20 min in 1.5 ml lysis buffer (50 mM HEPES/KOH, pH7.5, 150mM NaCl, 5mM 
KCl, 5mM MgCh, 501M ZnCh, 0.1% IGEPAL and protease inhibitor cocktail 
from Sigma). The lysates were pooled, supplemented with glycerol (10% final), 
and clarified by centrifugation at 20,000g for 20 min at 4°C in a Sorvall HB-6 rotor. 
Supernatants were collected, passed through a 0.45 1m Millipore PVDF filter, and 
mixed in a 50 ml Falcon tube with a 100 11 packed bead volume of Strep-Tactin 
Sepharose (IBA) washed with lysis buffer. After incubation for 2h at 4°C under 
constant rotation, the suspension was poured into a 10 ml disposable column (Bio- 
Rad). After sedimentation, the resin was washed twice with 10 ml lysis buffer con- 
taining 10% glycerol. The bound proteins were released by incubating the resin 
with 10 ml of lysis buffer supplemented with 10% glycerol and 2.5mM desthio- 
biotin (IBA 2x Tactin Elution Buffer) for 10 min on ice. The eluted material was 
directly mixed with a 100 :l packed volume of washed anti-Flag Sepharose beads 
(Sigma) and incubated with rotation for 1h at 4°C in a sealed 10 ml disposable 
column (Bio-Rad). Subsequently, the column was drained by gravity and the resin 
washed five times with 1 ml lysis buffer adjusted to 10% glycerol. Bound proteins 
were recovered by adding five times sequentially 200 1] lysis buffer supplemented 
with 200 1g ml“! Flag peptide (Sigma) and gently mixing for 5 min on ice. The 
eluted material from three such purifications was pooled and precipitated with 
acetone (80% final concentration). After centrifugation at 4°C in an Eppendorf 
microcentrifuge, the protein pellet was air-dried, resuspended in 1x Laemmli 
sample buffer, and run into a 4% SDS-PAGE stacking gel. The protein band was 
excised and processed for mass spectrometry analysis using an LTQ-Orbitrap Velos 
mass spectrometer (Thermo Scientific). Listed in Fig. 1b are proteins identified 
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with 99% certainty and represented by at least two peptides ascertained at a 95% 
confidence level. 

In Fig. 1c, extracts from 6 x 10° HepGz2 cells seeded on a 10-cm culture dish 
were prepared 2 days after transfection with 21g of EBS-PL expression plasmids. 
The experiment in Extended Data Fig. 1d was performed in the same way except 
that the transfected cells were selected in medium containing 200 1g ml! hygro- 
mycin B (PAA Laboratories) and extracts were prepared from a 15-cm dish of 
confluent cells. Proteins were purified by incubation with 25 11 packed bead volume 
of Strep-Tactin Sepharose (IBA). 

Cell cycle analysis. The cell cycle analysis in Extended Data Fig. 3a was per- 
formed starting with 1.5 x 10° HepG2 cells transduced with GFP or GFP-HBx 
and cultured as above. Cells were harvested, washed once with PBS, and recov- 
ered by centrifugation at 500g for 5 min. Cells were fixed in 1 ml ice-cold 70% 
ethanol added dropwise while vortexing and incubated for 30 min on ice. The 
fixed cells were pelleted by centrifugation at 800g for 15 min, washed twice with 
PBS and directly resuspended in 5011 PBS supplemented with 100g ml! RNase 
A (Roche). Cellular DNA was stained by addition of 60011 of propidium iodide 
(Sigma Aldrich; 50j1g ml“! in PBS) and incubation for 10 min at room tempera- 
ture (20-25 °C) and overnight at 4°C. Samples were analysed using a BD Accuri 
C6 flow cytometer. 

Western blotting. With the exception of Fig. 4b and Extended Data Fig. 6g (see 
below), western blot analysis was performed as described** except that cells were 
disrupted in 2% SDS in water and protein concentration was estimated using 
the BCA Protein Assay (Novagen). The membranes were probed with 1:5,000 
mouse anti-Flag antibodies (Sigma A2220) to detect the FS-tagged HBx-DDB1 
and SV5-V-DDB1 fusion proteins, 1:5,000 mouse monoclonal anti-GFP (Roche 
11814460001) to detect GFP-HBx in Extended Data Fig. 2a, b, 1:5,000 mouse 
monoclonal anti-HA (clone 16B12, Covance) to detect HA-Nseé4 in Fig. 2e, 1:2,000 
rabbit polyclonal antibodies against Smc5 or Smcé6 (ref. 39) (a gift from A. R. 
Lehmann), 1:1,000 rabbit polyclonal anti-Nse4 (Abgent AP9909A), 1:2,000 rabbit 
polyclonal anti-Cardif/MAVS (Enzo Life Sciences ALX-210-929), 1:500 mouse 
monoclonal anti-STAT1 (BD Transduction Laboratories G16920), 1:500 goat 
polyclonal anti-DDB1 (Everest Biotech), 1:500 rabbit polyclonal anti-HBc (a gift 
from M. A. Petit), 1:500 mouse monoclonal anti-ubiquitin (Santa Cruz SC-8017), 
1:10,000 mouse monoclonal anti-c-tubulin (Sigma-Aldrich T5168), or 1:10,000 
mouse monoclonal anti-GAPDH (Sigma-Aldrich G8795) antibodies. Horseradish- 
peroxidase-conjugated sheep anti-rabbit or anti-mouse, or bovine anti-goat IgG 
(Amersham Biosciences, 1:5,000), were used as secondary antibodies, and detec- 
tion was performed with ECL (Pierce). 

In Fig. 4b, flash-frozen liver tissue from uPA-SCID mice re-populated with 
human hepatocytes was homogenized in RIPA buffer containing a broad spec- 
trum protease inhibitor cocktail (Fisher Scientific PI-78430) using a Qiagen Tissue 
Lyser for mechanical tissue disruption. The resulting homogenates were clarified 
by centrifugation for 10 min at 4°C in an Eppendorf 5415D microcentrifuge 
equipped with a fixed-angle rotor (F45-24-11) at 15,996g. Protein concentration 
was determined using the Bradford Protein assay (Bio-Rad). The membranes were 
probed with 1:1,000 mouse monoclonal anti-human SMC6 (Abgent AT3956A), 
1:1,000 mouse anti- HBsAg (International ImmunoDiagnostics 1113), and 1:1,000 
monoclonal mouse anti-human CK-18 (Dako M701029-2). IRDye 680RD goat 
anti-rabbit or IRDye 800CW goat anti-mouse IgG (Licor, 1:5,000) were used as 
secondary antibodies. Blots were visualized using an Odyssey Infrared Imaging 
System (Licor). The species specificity of the anti-human SMC6 and CK-18 anti- 
bodies was confirmed by western blot detection of Smc6 and CK-18 in HepG2 
cells and PHHs but not in murine hepatocyte AML12 cells (ATCC CRL-2254, 
data not shown). 

In Extended Data Fig. 6g, PHHs were lysed in RIPA buffer supplemented with 
1x protease inhibitor cocktail (ThermoScientific), scraped from the plate, and 
sonicated for 10s to break up cellular debris. Protein concentration was estimated 
using a BCA Protein Assay (Novagen). The membranes were probed with 1:1,000 
mouse monoclonal anti-Smc5 (Bethyl Laboratories A300-236A), 1:1,000 mouse 
monoclonal anti-human SMC6 (Abgent AT3956A), 1:1,000 rabbit polyclonal anti- 
DDBI1 (Cell Signaling 5428), and 1:1,000 mouse monoclonal anti-GAPDH (Novus 
Biologicals G8795). Secondary antibodies and detection were as in Fig. 4b. 
RT-PCR analysis. In Extended Data Fig. 2c, total cellular RNA was extracted from 
HepG2? cells using TRIzol reagent (Invitrogen Life Technologies) following the 
manufacturer's instructions. RNA was treated with RNase-free DNase (Promega) 
in the presence of 1 Ul! RNAsin (Promega). After phenol/chloroform 
extraction and ethanol precipitation, 1 jug of RNA was reverse transcribed using 
MLV reverse transcriptase (Promega) and an oligo(dT)15 primer. Quantification 
by real-time PCR was performed as described®. The PCR values were normal- 
ized against those obtained for the TBP gene to correct for variation between 
samples. 
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In Extended Data Fig. 6e, f, i, total cellular RNA was isolated from PHHs cul- 
tured in 96-well plates using an RNeasy 96 Kit (Qiagen) following the manufac- 
turer’s instructions. Real-time RT-PCR was performed using a QuantStudio 7 Flex 
Real-Time PCR System (Invitrogen Life Technologies) following the manufacturer's 
instructions. The PCR values were normalized against those obtained for the 
6-actin gene to correct for variation between samples. All oligonucleotide primer 
sets were manufactured by Life Technologies. 

Northern blot analysis. Northern blot analysis in Fig. 3c and Extended Data Fig. 6c 
was performed on total RNA isolated using TRIzol Reagent (Gibco-Invitrogen) 
and treated with RNase-Free DNase I (Ambion, Life Technologies). The RNA 
(10g per sample) was denatured by glyoxal treatment and separated on a 1% 
agarose gel using a NorthernMax-Gly kit (Ambion, Life Technologies). After 
capillary transfer to Hybond Nt membranes (Amersham), the RNA was fixed 
on the membrane by ultraviolet cross-linking and hybridized with a **P-labelled 
full-length HBV genomic DNA or 28S ribosomal RNA oligonuclotide probes. 
HBV virion production. The HBV producing HepG? cell lines* were grown to 
confluence in DMEM/F12 as described above. When confluence was reached, 
cells were maintained in a 1:1 mixture of Dulbecco's modified Eagle's medium 
(DMEM) and Williams’ E medium (Invitrogen Life Technologies) supplemented 
with 5% fetal calf serum, 7 x 10°° M hydrocortisone hemisuccinate, 5 1g ml"! 
insulin, and 1% DMSO. HBV-containing supernatants were collected every 
2 days for 10 days. The supernatants were pooled, filtered through a 0.22 1m 
filtration unit (Merck Millipore), and concentrated by overnight precipitation 
with PEG 8000 (5% final) and centrifugation at 4°C for 1h at 6,000g. The viral 
pellet was resuspended in 1/50 of the original culture volume in PBS and sedi- 
mented at 4°C through a 20 ml 10-20% sucrose gradient in PBS at 130,000g for 
16h in a SW32 Beckman rotor. The final pellet was resuspended in 1/100 of the 
original volume in William’s E medium supplemented with 2% DMSO and 0.1% 
SVF and stored in aliquots at — 80°C. Infectious virus titre was estimated by real- 
time PCR quantification of the viral DNA recovered by immunoprecipitation 
with an excess of mouse monoclonal antibodies against the large envelope protein 
(preS1)*°. Real-time PCR was performed using the SYBR Green PCR Master Mix 
(Roche Applied Science) and a LightCycler 480 system (Roche Applied Science) 
as described™. 

HBV virion production from HepAD38 cells was as follows. Cells were 
grown to confluence in DMEM/F12 supplemented with 10% fetal bovine serum 
(HyClone, Thermo Scientific), 1% non-essential amino acids, 501g ml~! kana- 
mycin, 200j1g ml“! geneticin (all from Gibco Life Technologies), and 0.3 pg ml! 
tetracycline (Sigma-Aldrich). Once confluence was reached, the medium was 
exchanged for identical medium as above, but lacking tetracycline. After 10 days, 
the virus-containing medium was collected every 3-4 days for 21 days and stored 
at —80°C. Subsequently, the aliquots were thawed, precipitated overnight at 4°C 
using 6% PEG-8000 (Promega), and centrifuged at 4°C for 15 min at 1,500g. Viral 
DNA was isolated using a DNeasy Blood & Tissue Kit (Qiagen). Viral titre was 
determined by real-time PCR. 

PHH isolation and HBV infection. PHHs were isolated from resected normal 
human liver tissue using a two-step perfusion method and cultured as described". 
PHHs were infected with PCR-normalized HBV virus stocks at a multiplicity of 
infection of 200-400 viral genome equivalents per cell**. The experiments in 
Fig. 3b, e and Extended Data Fig. 6e-i were performed using PHHs purchased from 
Life Technologies and maintained in William's E medium with added supplements 
as specified by the vendor. Cells were infected 24h after plating on collagen-coated 
96-well plates (BD Biosciences) with HepAD38-derived HBV virions at 500 viral 
genome equivalents per cell. In Extended Data Fig. 6g, cells were cultured on 
collagen-coated six-well plates (BD Biosciences). 

Human liver chimaeric uPA-SCID mice. All animal work was performed by 
Phoenix Bio, in accordance with the Guide for the Care and Use of Laboratory 
Animals and approved by the Animal Ethics Committee of Phoenix Bio. Nine male 
uPA-SCID mice were transplanted at 16-23 days old with 1 x 10° PHHs froma sin- 
gle healthy donor as described*. After 10-13 weeks, six of these mice were infected 
by intraperitoneal injection with 5 x 10° genome equivalents of cell-culture-derived 
HBV genotype C. These animals were killed and serum and liver specimens were 
collected for measurement of HBV infection 14 weeks later. Serum HBV DNA 
reached a titre of >1.5 x 10’ copies per millilitre and serum HBsAg levels were 
>3.2 x 10? IU ml! in all infected animals. As a control, three of the nine mice 
were left uninfected. These animals were killed at 15 weeks after transplantation 
and then processed identically to the HBV-infected animals. 

ELISA. The amounts of HBe and HBs antigens present in control and HBV- 
infected PHH culture media were determined at the indicated times by ELISA 
using Monolisa HBe Ag-Ab PLUS (Bio-Rad) and Monolisa anti-HBs PLUS (Bio- 
Rad) kits and a Chemiluminescence Immunoassay (Autobio Diagnostics) kit. In 
Fig. 3e and Extended Data Fig. 6h, ELISA was performed using an HBeAg EIA 
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(International Immunodiagnostics) kit and a SpectraMax M5 reader (Molecular 
Devices). Purified HBeAg was used as standard. 

Immunofluorescence and confocal analysis. The proportion of PHHs infected 
with HBV in Fig. 3d and Extended Data Fig. 5 was estimated by confocal immu- 
nofluorescence microscopy for HBsAg expression™’. PHHs were cultured and 
infected on collagen-I-coated glass slides. At 9-12 days after infection, cells were 
fixed with 3% paraformaldehyde in PBS and permeabilized with 0.1% saponin in 
PBS for 30 min at room temperature. Cells were stained with mouse monoclo- 
nal anti-HBs antibodies (a gift from M. A. Petit) diluted 1:200 in PBS contain- 
ing 0.1% saponin and 3% bovine serum albumin (BSA). After washing in the 
same buffer, bound antibodies were detected using Alexa-Fluor-647-conjugated 
goat anti-mouse secondary antibodies (Invitrogen). After additional washes, the 
slides were mounted with fluorescence mounting medium (ProLong Gold, Life 
Technologies) and observed under a Leica SP5 X confocal microscope. Image 
analysis used ImageJ software. 

Confocal analysis of Smcé6 expression in HBV-infected and uninfected PHHs in 
Fig. 3b and Extended Data Fig. 7 was performed as follows. PHHs were seeded onto 
rat tail collagen-I-coated glass coverslips (Corning BioCoat 354089) at a density 
of 1.2 x 10° cells per well of a 6-well dish and allowed to adhere overnight. Cells 
were then mock-infected or infected at 500 viral genome equivalents per cell with 
wild-type HBV derived from HepAD38 cells. On day 13 after infection, the cells 
were washed twice with PBS and fixed in freshly prepared 4% paraformaldehyde 
(Electron Microscopy Services RT-15710) in PBS for 10 min at room temperature. 
After three washes in PBS, the cells were permeabilized in 1% TX-100 in PBS 
for 10 min at room temperature. Coverslips were incubated overnight at 4°C in 
3% normal goat serum (Jackson ImmunoResearch Laboratories 005-000-121) 
diluted in PBS to quench non-specific background. Coverslips were then inverted 
onto 100,11 droplets containing mouse anti-human SMC6 (Abgent AT3956a, 1:500) 
and polyclonal rabbit anti- HBcAg (Dako B0586, 1:1,600) diluted in 3% normal goat 
serum, and incubated at room temperature for 1h. As a control, cells were reacted 
in parallel with purified mouse IgG2a isotype-matched control (Life Technologies 
MG2a00). After eight washes in PBS, coverslips were inverted onto 10011 drop- 
lets of highly cross-adsorbed Alexa Fluor 488 goat-anti-mouse (Molecular Probes 
A11029) or Alexa Fluor 594 goat-anti-rabbit (Molecular Probes A11037) secondary 
antibodies diluted 1:200 in 3% normal goat serum, and incubated for 1h at room 
temperature. Coverslips were washed eight times in PBS and rinsed in double- 
distilled water before mounting onto glass slides using Vectashield DAPI- 
containing hardening mounting medium (Vector Laboratories H-1500). Imaging 
was performed using an upright Zeiss LSM780 confocal system equipped with a 
x63 objective lens (NA 1.4) and Zen software. All images within each sample set 
were captured using identical confocal settings. Images were adjusted for brightness 
and contrast using Adobe Photoshop CS6. 

For immunofluorescence analysis of humanized mouse liver tissues, flash- 
frozen liver tissues was cryosectioned at a thickness of 51m onto microscope slides, 
fixed for 10 min in ice-cold 4% paraformaldehyde in PBS, and then washed three 
times in PBS. After one wash in Tween 20-containing AutoWash (BioCare Medical 
TWB945M), tissue sections were incubated with Rodent Block M solution (BioCare 
Medical RBM961L) for 30 min at room temperature, then stained overnight at 
4°C in a humidified chamber with mouse monoclonal anti-human Smcé6 (Abgent 
AT3956a, 1:200) and rabbit polyclonal anti- HBsAg (Virostat 1811, 1:500) antibodies 
diluted in a 1:1 solution of Rodent Block M (BioCare Medical RBM961) and 
Renoir Red diluent (BioCare Medical PD904M). In parallel, re-population of liver 
tissue with human hepatocytes was evaluated by staining representative sections 
from each animal with goat anti-human albumin polyclonal antibodies (Bethyl 
A80-129A, 1:200), whose species specificity has been demonstrated”, and for 
selected animals by co-staining with mouse monoclonal anti-human cytokeratin- 
18 antibody (Dako M7010, 1:25). Mouse IgG1 (Dako X0931) diluted in Dako 
Antibody Diluent S0809 served as a negative control (see Extended Data Fig. 9). 
After three 5 min washes in AutoWash buffer, tissue sections were reacted for 1h 
at room temperature with Alexa Fluor 488 donkey anti-mouse and Alexa Fluor 
594 donkey anti-rabbit secondary antibodies diluted 1:1,000 in PBS. Coverslips 
were applied using Vectashield DAPI-containing embedding medium (Vector 
Laboratories H-1500). Images were acquired using a x40 objective lens (Fig. 4a) 
or a X20 objective lens (Extended Data Fig. 8) and an inverted epifluorescence 
microscope (Leica DM LB) and arranged using Adobe Photoshop CS6. Note that 
imaging in Fig. 4a and Extended Data Fig. 8a was limited to areas completely 
re-populated with human hepatocytes, which were readily distinguished from 
poorly engrafted areas of mouse liver tissue on the basis of cellular morphology 
Extended Data Fig. 8b. 

ChIP and quantitative PCR. The ChIP analysis in Fig. 2e was performed using 
chromatin extracted from about 7 x 10° HepG2 cells cultured in 100 mm diameter 
wells as above and expressing HA-Nse4 from a lentiviral vector. In Fig. 3f, freshly 


prepared PHHs were seeded into 75 cm? collagen-coated tissue culture flasks (107 
cells per flask). Two days later, cells were infected with wild-type or HBx-mutant 
HBV particles. After culture for the indicated periods, cells were fixed with 1% 
formaldehyde (Sigma Aldrich 47608) for 10 min at 37°C before quenching with 
330 mM glycine. Cells were rinsed twice with ice cold PBS containing EDTA-free 
protease inhibitors (Roche) and 5mM Na-butyrate, scraped off in 1.5 ml of the 
same buffer and collected by centrifugation. Cells were resuspended and lysed for 
10 min at 4°C in 1 ml Nuclear Extraction buffer (10 mM Tris-HCl (pH 8.0), 10 mM 
NaCl, 1% NP-40) supplemented with protease inhibitor cocktail (Roche). The 
nuclei were recovered by centrifugation at 500g for 5 min at 4°C in an Eppendorf 
fixed-angle rotor (FA-45-18-11) and washed once in the same buffer. Nuclei were 
resuspended in 10011 FA-lysis buffer (50 mM HEPES/KOH, pH 7.5, 140 mM 
NaCl, 1mM EDTA, 1% Triton X-100) containing 1% SDS and incubated for 
10 min at room temperature. After addition of another 10011 FA-lysis buffer, the 
mixture was transferred to 1.5 ml bioruptor microtubes (Diagenode), and soni- 
cated using a Diagenode Bioruptor Pico water bath sonicator (ten cycles of 15s on 
and 30s off at high setting). The sonicated lysates were clarified by centrifugation 
at 16,000g for 10 min and mixed in 800 1l FA-lysis buffer supplemented with 
protease inhibitors with 40 1l packed bead volume of protein A-Sepharose CL-4B 
(GE Healthcare) coupled to 4,11 anti-HA antibodies (clone 16B12, Covance) or 
611 anti-Nse4 antibodies (Abgent AP9909A) and pre-incubated in FA-lysis buffer 
with sonicated salmon sperm DNA and BSA. After an overnight incubation at 
4°C under constant rotation, the beads were sedimented by brief centrifugation 
and the supernatant either discarded or in Extended Data Fig. 3c directly mixed 
and incubated overnight at 4°C with anti-Nse4 affinity beads as above. The beads 
were washed twice with 1 ml FA-lysis buffer, twice with FA-lysis buffer adjusted 
to 500 mM NaCl and 0.5% sodium deoxycholate, and twice with 10 mM Tris- 
HCl buffer (pH 8.0) containing 1mM EDTA, 250 mM LiCl, 1% NP-40, and 1% 
sodium deoxycholate. Bound protein-DNA complexes were released from the 
HA- and Nse4-affinity resins by incubation for 10 min at 65°C in 200,11 buffer 
containing 100 mM Tris-HCl (pH 8.0), 1% SDS, 10mM EDTA, and 10mM EGTA. 
After addition of 200 11 Tris-EDTA (pH 8.0) and 60\.g proteinase K (Bioworld 
Technology), DNA crosslinks were reversed by overnight incubation at 65°C. 
Samples were extracted twice with phenol-chloroform, once with chloroform, 
and then ethanol precipitated and resuspended in Tris-EDTA buffer. The input 
DNA treated identically and the recovered DNA were quantified in two separate 
real-time PCR runs using the KAPA SYBR FAST qPCR Kit Master Mix (2X) 
Universal (Kapa Biosystems) and the Bio-Rad CFX96 Real-time PCR System. The 
values were calculated as the ratios between the ChIP signals and the respective 
input DNA signals. In Fig. 3e, primers specific for the covalently closed circular 
HBV DNA (cccDNA)"* were used. Sequences of the oligonucleotide primers are 
given in Supplementary Table 2. 

Statistical analysis. No statistical methods were used to predetermine sample size. 
Statistical significance was tested using a one-tailed, paired t-test (for two sample 
comparisons) or one-way ANOVA with Dunnett's multiple comparison correction 
of log-transfomed data (for multiple comparisons). A value of P< 0.05 was consid- 
ered significant. The experiments were not randomized, and the investigators were 
not blinded to allocation during experiments and outcome assessment. 

Ethics statement. PHHs were isolated from liver specimens resected from patients 
undergoing partial hepatectomy (provided by M. Rivoire). Approval from the 
local and national ethics committees (French Ministry of Research and Education 
numbers AC-2013-1871 and DC-2013-1870) and informed consent from patients 
were obtained. 
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Extended Data Figure 1 | A strategy to identify E3 ubiquitin ligase 
substrates. a, Stable HepG2 cell lines were generated expressing GFP or 
the indicated GFP-tagged viral proteins carrying an N-terminal tandem 
Flag/StrepII (FS) tag from a doxycycline-inducible promoter (ref. 29 

and see Methods). After doxycycline induction, whole-cell extracts were 
prepared and the FS-tagged and associated proteins were purified by TAP. 
Eluted proteins were separated by gel electrophoresis and stained with 
Coomassie blue. CUL4 was identified by mass spectrometry. No protein 
candidate was identified showing a profile predicted for an HBx substrate: 
that is, co-purifying specifically with HBx and WHx but not with SV5-V, 
or co-purifying only with the DDB1-binding-defective HBx(R96E) mutant 
because of its otherwise unstable interaction with HBx and WHx when 
recruited to the E3 ligase. In particular, the Smc5/6 complex was not 
detected in these purifications. b, Purification of SV5-V and associated 
proteins. FS-tagged SV5-V was purified from a stably expressing HepG2 
cell line by TAP as above. Proteins were eluted under native conditions 
and processed for analysis by peptide fragmentation sequencing 
(nano-LC-ESI MS/MS). The number of unique peptides for each protein 
and the percentage of total protein sequence covered by these peptides 
are indicated. Listed are proteins identified with 100% certainty and 
represented by at least two peptides identified at a 95% confidence level. 
CUL4A and CUL4B are closely related paralogs. Note that no peptides 
were detected, even at low confidence level, for STAT 1 that is well known 
to be recruited by SV5-V to the DDB1 E3 ligase for ubiquitin-mediated 
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degradation. By contrast MDAS, a cytoplasmic sensor of viral RNA to 
which SV5-V also binds but, in contrast to STAT 1, without causing its 
degradation“, was present in the SV5-V purifications. This suggests that, 
in contrast to MDAS, STAT 1 associates with SV5-V very transiently, 
presumably because of its rapid degradation or dissociation from the E3 
ligase complex. We considered that the same may be true for the HBx 
target, thus precluding its identification by regular affinity purification. 

c, As a proof of principle for the use of a fusion strategy to identify the 
HBx target, we covalently linked SV5-V to wild-type DDB1 or to a CUL4- 
binding-defective DDB1 mutant that could not incorporate in the E3 ligase 
complex. See text for details. d, HepG2 cells were mock transfected (—) or 
transfected with plasmid DNA expressing the indicated FS-tagged SV5- 
V-DDB1 fusions. Whole-cell extracts were prepared and the fusion proteins 
purified by a single round of affinity purification. The amounts of fusion 
proteins recovered (IP) and the presence of STAT] in the eluates (co-IP) 
were assessed by western blotting. The fusion proteins were revealed using 
anti-Flag antibodies and were expressed at too low levels for detection in 
the crude extracts (input). Note that high amounts of STAT] are recovered 
only with the mutant SV5-V-DDBI fusion. e, Architecture of the Smc5/6 
complex. The core of the Smc5/6 complex is formed by a heterodimer of 
Smc5 and Smc6. The two proteins form a V-shaped structure and associate 
with four non-SMC proteins, designated Nsel—Nse4 (refs 15, 17). Note 
that depletion of any of the Smc5/6 complex subunit, other than Nse2, 
results in destabilization and degradation of the entire complex”'. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | HBx induces rapid depletion of the Smc5/6 
complex by an MG132 proteasome inhibitor-sensitive pathway to 
stimulate extrachromosomal reporter gene activity. a, Same experiment 
as in Fig. 2a except that luciferase activity and Smc6 protein levels were 
monitored at the indicated time points after transduction of GFP or 
GFP-HBx. Luciferase activities are expressed in arbitrary units, with the 
fold increase in HBx-expressing cells relative to GFP control cells indicated 
on top. GFP-HBx was detected using anti-GFP antibodies. One out of two 
independent experiments. b, HepG2 cells were transduced with lentiviral 
vectors encoding GFP or GFP-HBx. Then, 16h later, DMSO (control) 

or 10,4M MG132 was added to the culture to inhibit proteasome activity. 
Cells were harvested 8 h later and the level of the indicated proteins and 
of global ubiquitination was analysed by western blotting. GFP-HBx 

was detected using anti-GFP antibodies. One out of two independent 


experiments. c, The effect of the indicated viral proteins on Smc5, Smc6, 
and luciferase mRNA levels was determined by real-time RT-PCR. The 
values are relative to those measured in GFP-transduced cells, which were 
set to 1. d, Similar experiment as in Fig. 2b except that two concentrations 
of MLN4924 were used. e, Similar experiment as in Fig. 2c. f, Similar 
experiment as in Fig. 2d but with HepG2 cells containing the HBV 
Enhancer-I-driven firefly luciferase reporter gene (HBV-FLuc) integrated 
into the chromosome and an episomal Renilla luciferase construct 

driven by the CMV promoter (CMV-RLuc). Note that the shRNAs against 
Smc5 and Smc6 used in this experiment differ from those in Fig. 2d 

(see Methods). g, Same with the Renilla luciferase CMV promoter 
construct chromosomally integrated and a firefly luciferase reporter gene 
driven by the EFla promoter episomal. Data in c, f and g represent the 
mean + s.e.m. of three independent experiments. 
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Extended Data Figure 3 | Smc5/6 degradation by HBx does not alter 
the cell cycle or promote chromosomal integration of the reporter 

but prevents the binding of Smc5/6 to the episomal DNA template. 

a, HepG2 cells were transfected with the HBV Enh1 luciferase reporter 
plasmid, split, and then either mock-transduced or transduced in 
triplicates with lentiviral vectors encoding GFP or GFP-HBx. Luciferase 
activity and protein expression (left panel) were monitored as in Fig. 2 

at 5 days after transfection. In parallel, cells were analysed by FACS for 
GFP positivity (middle inset) and for DNA content (right panels) after 
propidium iodide staining. b, HBx does not promote stable integration of 
the reporter gene into chromosomes. HepG2 cells were transduced with a 
lentiviral vector to chromosomally integrate a Renilla luciferase reporter 
gene and subsequently transfected with a firefly reporter construct. Cells 
were then split equally and transduced with lentiviral vectors encoding 
GFP or GFP-HBx. Activity of the episomal firefly (left) and integrated 
Renilla (right) luciferase genes was monitored at the indicated time 
points after transduction of GFP or GFP-HBx. The values are expressed 
in arbitrary units relative to those measured at day 1 in the GFP control, 
which was set to 1.0. Indicated above the columns in the left panel is 

the remaining luciferase activity (expressed in per cent) relative to that 


measured 3 days after transfection. Note that in the absence of HBx 

(black bars in the left panel) the episomal firefly luciferase signal decreases 
rapidly with time, presumably because of loss of the reporter plasmid. 

In the presence of HBx (red bars in the left panel), the signal slightly 
increases to reach sixfold higher levels relative to the GFP control at 3 days 
after transfection, and then drops with kinetics close to that observed in 
the GFP control. In contrast, expression of the integrated Renilla reporter 
remains constant (right panel). This argues against HBx stimulating 
episomal reporter activity by promoting its integration into chromosomes. 
c, Left panels, the same ChIP experiment using anti-HA antibodies as in 
Fig. 2e but with HepG2 cells containing the HBV Enhancer-I-driven firefly 
luciferase reporter (HBV Enh1-FLuc) integrated into the chromosome and 
an episomal EF 1a Renilla luciferase construct (EFla-RLuc) as a control. 
Data represent the mean +s.e.m. of three independent experiments. In one 
experiment, the unbound material from the indicated samples expressing 
no HA-tagged Nse4 was further purified using anti-Nse4 antibodies (lower 
right). Data are expressed as the percentage of input DNA recovered by 
ChIP. Note that in this experiment, the episomal reporter was delivered 
using an integrase-defective lentiviral vector™. 
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Extended Data Figure 4 | HBx or Smc5/6 knockdown does not affect 
HBV genome copy number. a, PHHs were infected with normalized 


stocks of wild-type (WT) or HBx-deficient (AX) HBV as in Fig. 3. HBeAg 


secretion (top) was assessed 10 days later by ELISA. In parallel, the 


amounts of cytoplasmic DNA replicative intermediates produced during 


reverse transcription of the viral pregenomic RNA (middle) and of the 


nuclear episomal HBV template extracted using a modified Hirt procedure 


(bottom) were analysed by Southern blot as reported**. b, PHHs were 
either mock transduced (none) or transduced with a lentiviral construct 


expressing an shRNA specific for Smcé6 and the next day infected with 
HBx-deficient HBV particles (HBV(AX) as in Fig. 3c. HBeAg secretion 
was assessed 12 days later by ELISA (top). In parallel, the amount of 
nuclear HBV genome was quantified by real-time FRET-PCR™ both 
directly or after treatment with Plasmid-safe DNase (Epicentre), an 
exonuclease that degrades single-stranded and double-stranded linear 
DNA but not the episomal HBV DNA (bottom left). Shown as a control 
for Plasmid-safe DNase treatment are the results for the chromosomal 
68-globin gene (bottom right). 
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Extended Data Figure 5 | Efficiency of PHH infection by HBV. The noise signals measured with the supernatants from the uninfected cells 
a, Purified PHHs were either left uninfected (0) or infected with wild-type were 0.50 NCU ml ! and 0.88 ng ml | b, PHHs infected with normalized 
HBV particles at the indicated viral-genome equivalents per cell (see stocks of wild-type (WT) or HBx-deficient (AX) HBV at the indicated viral 
Methods). HBeAg and surface antigen (HBsAg) secretion into the culture genome equivalents per cell were examined for HBsAg expression at 9 days 
supernatants was assessed 10 days later by ELISA. Concentrations are after infection by indirect immunofluorescence confocal microscopy. Cell 
expressed in, respectively, national clinical units (NCU) per millilitre nuclei were stained with Hoechst dye (blue). c, Quantification of confocal 
(HBeAg) and nanograms per millilitre (HBsAg) and were determined images. Data are mean + s.d. of at least three fields. Note that infection in 
according to the manufacturers’ guidelines (Autobio Diagnostics). Fig. 3 was with 200 viral genome equivalents per cell or more. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Silencing of Smc5/6 restores HBx-negative 
HBV transcription and rescues wild-type HBV on a DDB1 knockdown 
background. a, Biological replicate of Fig. 3a using PHHs from a different 
donor. b, Control for lentiviral shRNA-mediated depletion in PHHs. 
PHHs were transduced with lentiviral constructs expressing the indicated 
shRNAs and infected with HBV as in Fig. 3c. HBsAg secretion and the 
amounts of the indicated proteins were assessed 9 days later by ELISA 

and western blot analysis. HBsAg concentrations are relative to those 
measured in mock-transduced cells infected with wild-type HBV, which 
were taken as 100 (not shown). c, Independent northern blot analysis of 
HBV RNA production as in Fig. 3c using PHHs from a different donor. All 
lanes are from the same gel and exposure. Shown in the upper panel is the 
corresponding ELISA for HBeAg secretion. d, Smc5/6 silencing restores 
expression of HBx-negative HBV in HepaRG cells. Differentiated HepaRG 
cells were either left uninfected or infected with normalized stocks of 
wild-type (WT) or HBx-deficient (AX) HBV particles. Twenty-four hours 
later, cells were either mock transfected (none) or transfected with the 
indicated siRNA. HBeAg secretion was measured 10 days after infection 
by ELISA. The values are in arbitrary units relative to the wild-type HBV 


control, which was given a value of 100. Shown in the upper and lower 
panels are two independent experiments. CyPB, cyclophilin B. e, HBV 
mRNA expression. Total RNA was extracted from the samples analysed 
for HBeAg secretion in Fig. 3e. HBV mRNA levels were measured by 
real-time RT-PCR. The values normalized to B-actin are given in arbitrary 
units relative to those measured in mock-transfected cells, which were set 
to 100. Data represent the mean + s.e.m. of n= 4 independent experiments 
performed with two different PHH donors. f, Control of siRNA efficacy. 
DDBI (grey bars) and Smcé6 (black bars) mRNA levels in Fig. 3e were 
measured by real-time RT-PCR and normalized to 3-actin as in e. The 
values are given in arbitrary units relative to those measured in mock- 
transfected cells, which were set to 100. Data represent the mean + s.e.m. 
of n= 4 independent experiments performed with two different PHH 
donors. g, Control western blot analysis of siRNA-mediated protein 
depletion. PHHs were transfected with the indicated siRNAs 3 days after 
plating. Protein levels were measured 10 days later. h, Data of one of the 
four experiments used in Fig. 3e that includes an siRNA targeting Smc5. 
Mean +s.e.m. of triplicate measurements. i, Same experiment as in h but 
measuring HBV mRNA levels. Mean + s.e.m. of triplicate measurements. 
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Extended Data Figure 7 | HBV infection induces Smc6 degradation in b, Representative images of uninfected PHHs stained with mouse isotype- 
PHHs. a, Same as in Fig. 3b but 4 x 4 contiguous images were acquired control-matched IgG2a primary antibodies or Alexa Fluor 488- and 


and stitched together to produce a single image for examination of 594-conjugated secondary antibodies alone. Samples were imaged and 
200-300 PHHs. Shown are Smc6 (green), HBcAg (red), and DAPI (blue). processed as in Fig. 3b. 
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Extended Data Figure 8 | HBV infection induces Smc6 degradation in nuclear stain (blue) and with antibodies against human albumin (red) 
humanized mouse liver tissue. a, Same as in Fig. 4a but with different and human Smcé6 (green). In the middle and right panels, a yellow dotted 
animal samples. b, Specificity of detection of human Smcé6 in humanized line delineates the interface between the human (Hu) and mouse (Ms) 
mouse liver tissues. Fresh-frozen uninfected (animal 62) and infected hepatocyte populations. 


(animal 12) humanized mouse liver tissues were stained with DAPI 
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Extended Data Figure 9 | Human albumin and human cytokeratin-18 (CK-18, green). A yellow dotted line delineates the interface between the 
as markers for human hepatocytes in humanized mouse liver tissues. human (Hu) and mouse (Ms) hepatocyte populations in the third column 
Fresh-frozen uninfected (animal 54) or infected (animal 12) humanized from left. Shown on the right are control staining of liver tissue from 
mouse liver tissues were stained with DAPI nuclear stain (blue) and with animal 12 with secondary antibody only or IgG1 isotype control. 


antibodies against human albumin (red) and human cytokeratin-18 
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Sequence-dependent but not sequence-specific 
piRNA adhesion traps mRNAs to the germ plasm 


Anastassios Vourekas!*, Panagiotis Alexiou", Nicholas Vrettos!, Manolis Maragkakis! & Zissimos Mourelatos! 


The conserved Piwi family of proteins and piwi-interacting 
RNAs (piRNAs) have a central role in genomic stability, which 
is inextricably linked to germ-cell formation, by forming Piwi 
ribonucleoproteins (piRNPs) that silence transposable elements!. 
In Drosophila melanogaster and other animals, primordial germ- 
cell specification in the developing embryo is driven by maternal 
messenger RNAs and proteins that assemble into specialized 
messenger ribonucleoproteins (mRNPs) localized in the germ (pole) 
plasm at the posterior of the oocyte”. Maternal piRNPs, especially 
those loaded on the Piwi protein Aubergine (Aub), are transmitted 
to the germ plasm to initiate transposon silencing in the offspring 
germ line’. The transport of mRNAs to the oocyte by midoogenesis 
is an active, microtubule-dependent process®; mRNAs necessary 
for primordial germ-cell formation are enriched in the germ plasm 
at late oogenesis via a diffusion and entrapment mechanism, the 
molecular identity of which remains unknown®”. Aub is a central 
component of germ granule RNPs, which house mRNAs in the germ 
plasm!°!’, and interactions between Aub and Tudor are essential 
for the formation of germ granules!*-'!°, Here we show that Aub- 
loaded piRNAs use partial base-pairing characteristics of Argonaute 
RNPs to bind mRNAs randomly in Drosophila, acting as an adhesive 
trap that captures mRNAs in the germ plasm, in a Tudor-dependent 
manner. Notably, germ plasm mRNAs in drosophilids are generally 
longer and more abundant than other mRNAs, suggesting that they 
provide more target sites for piRNAs to promote their preferential 
tethering in germ granules. Thus, complexes containing Tudor, 
Aub piRNPs and mRNAs couple piRNA inheritance with germline 
specification. Our findings reveal an unexpected function for piRNP 
complexes in mRNA trapping that may be generally relevant to the 
function of animal germ granules. 

We performed ultraviolet crosslinking followed by stringent immu- 
noprecipitation (CLIP)!” for Aub (Fig. 1a) and standard small RNA 
immunoprecipitation, using a highly specific antibody that we gener- 
ated (Extended Data Fig. 1a) from wild-type (yw) ovaries and from yw 
and Tudor-null (tud) Drosophila embryos collected up to 2h after laying 
(0-2-h embryos); this is before zygotic transcription and degradation 
of maternal mRNAs. Crosslinked RNA-Aub complexes yielded strong, 
specific signals that were absent from non-immune serum and no- 
ultraviolet controls (Fig. 1a). CLIP and immunoprecipitation libraries 
contained essentially identical 23-29-nucleotide piRNAs (Fig. 1b, 
Extended Data Figs 1b-g and 2a-f and Extended Data Table 1). 
We verified minimal changes in the piRNA load of Aub in tud ver- 
sus yw ovaries’* (Extended Data Fig. 2g), and found no changes in the 
piRNA load of 0-2-h embryos compared to ovaries in both genotypes 
(Extended Data Fig. 2h, i). Larger CLIP tags (IgCLIPs, >36 nucleotides) 
are present in libraries prepared from larger RNP complexes (Fig. la—c, 
Extended Data Fig. 1d and Supplementary Results). 

We observe considerable overlap of retrotransposon lgCLIPs with 
complementary piRNAs (Extended Data Fig. 3a and Supplementary 


Table 1) and strong positive correlation of their abundances (Extended 
Data Fig. 3b, c). Relative distance analysis reveals high occurrence 
of lgCLIPs with a 10-nucleotide overlap to complementary piRNAs 
(Fig. 1d, peak at position +9) for all three genotypes. The majority 
of such lgCLIPs bear an adenine at the tenth position (Fig. le), and 
show prominent 5’—5’ end coincidence with Ago3 piRNAs (Fig. 1f), 
indicating that they correspond to ping-pong intermediate fragments 
produced by Aub slicing!. Furthermore, a second peak at position —15 
(Fig. 1d), which is 25 nucleotides (the median Aub piRNA length) from 
position +9, represents 5’ ends of fragments of trigger piRNA targets 
undergoing phased piRNA biogenesis’®. The above results indicate that 
CLIP captures piRNA biogenesis, complementary retrotransposon 
targeting and the transient products of Aub slicing activity (Fig. 1g). 

A large percentage (~50-66%) of lgCLIPs from all CLIP libraries 
are mRNA-derived (Fig. 1c and Extended Data Fig. 1g). Most Aub- 
bound mRNAs are not substrates for piRNA processing (Extended Data 
Fig. 4a). The Aub lgCLIP density is relatively high within 3’ untrans- 
lated regions (UTRs) compared to RNA sequencing (RNA-seq) ana- 
lysis, and overall lgCLIP abundance is not correlated with mRNA 
abundance (Extended Data Fig. 4b-d), suggesting specific target mRNA 
recognition. We cross-indexed Aub-bound mRNAs with the mRNA 
localization categories (compiled in ref. 19). Notably, posterior locali- 
zation categories are significantly enriched in all three sets of Aub CLIP 
libraries (embryo: yw and tud, ovary: yw) (Supplementary Table 2). 
Most importantly, we find 15 posterior and germ-cell localization 
categories significantly depleted, and ubiquitous mRNAs enriched in 
tud embryo compared to yw embryo CLIP libraries (Supplementary 
Table 3). Posteriorly localized mRNAs appear marginally upregulated 
compared to other localization categories in tud versus yw embryo 
RNA-seq libraries (two-sided t-test, P=0.01594), ruling out the pos- 
sibility that the reduced Aub binding is due to reduced posterior mRNA 
levels in tud embryos. Both Aub (Extended Data Fig. 1a) and germ 
plasm mRNAs'*” are uniformly distributed throughout tud embryos; 
therefore, the observed loss of binding specificity towards posterior 
mRNAs in the absence of Tudor can only be attributed to the disruption 
of the germ plasm. Thus, our experimental approach allows the iden- 
tification of the mRNAs specifically bound by Aub in the germ plasm, 
irrespective of the function of Aub in the clearance of maternal mRNAs 
in the somatic part of the embryo”!”. To identify the primary mRNA 
targets of Aub within the germ plasm during the formation of germ 
cells, we calculated the rank product of the normalized lgCLIP values 
for mRNAs in the 12 posterior localization categories marked with 
an asterisk in Supplementary Table 3, from three replicate yw embryo 
libraries (P <0.05). The list contains 220 genes, many of which appear 
enriched or selectively protected in germ cells'°, and with established 
roles in germ-cell specification and development such as cycB, nos, 
osk, gcl, pgc and Hsp83 (Supplementary Table 4). Characterization of 
Aub RNPs from early embryos provides independent support for the 
association of germ plasm mRNAs with Aub (Supplementary Results 
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and Extended Data Fig. 5). Four separate analyses provide strong evi- 
dence that the extent of the observed Aub binding of mRNAs cannot 
be explained by piRNA targeting of transposon sequences embedded 


Relative to 5’ of Ago3 piRNA Captured by CLIP 


in mRNAs (Supplementary Results and Extended Data Fig. 6). 


+1 


Complementarity events (%) 


To investigate the potential of piRNAs to direct Aub to complemen- 
tary mRNA sequences further, we analysed chimaeric lgCLIPs 
that each contain an intact piRNA, ligated with a sequence fragment 
(20 nucleotides) that is uniquely aligned on mRNAs (Fig. 2a and 
Supplementary Table 5). 'To uncover complementarity patterns, we 
implemented unweighted local alignment between the piRNA (in 
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Figure 1 | Transcriptome-wide identification of 
RNAs bound by Aub and in vivo retrotransposon 
targeting and slicing captured by CLIP. 

a, Aub CLIPs; separate libraries were prepared 
from RNA extracted from indicated positions. 
Uncropped gels can be found in Supplementary 
Fig. 1. kDa, kilodaltons; NRS, non-immune 
serum; UV, ultraviolet. b, Size distribution and 

5/ end nucleotide (nt) composition per size 

of CLIP tag. Error bars represent s.d.; n =3 
(biological replicates, the same applies to 

c, eand g). c, Genomic distribution of CLIP 

tags for three high yw embryo (0-2h) Aub 

CLIPs. d, Position of 5’ ends of retrotransposon 
lgCLIPs relative to 5’ ends of complementary 
piRNAs (0, x axis). e, Nucleotide composition 

at +9 of retrotransposon-derived lgCLIPs with 
10-nucleotide overlap to complementary piRNAs. 
f, yw ovary Aub lgCLIP 5’ end positions relative 
to the 5’ ends of Ago3-loaded piRNAs (0, x axis). 
g, Schematic of processing fragments captured by 
Aub CLIP. 


reverse complement orientation) and the mRNA fragment, scoring 
matches (+1), mismatches (—1) and indels (—2), and reporting the 
best alignment for every chimaeric read. The search was performed 


within +100 bases around the midpoint of the mRNA fragment; this 
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allows the identification of the entire complementary sequence that 
might be missing from the chimaeric fragment, and also provides a reli- 
able estimate of the signal-to-noise ratio. We observed prominent peaks 
of hundreds of thousands of complementarity events forming around 
the midpoint and within +25 nucleotides, in yw and tud embryo CLIP 
libraries (Fig. 2b, c). Most events score between 7 and 12; therefore, the 


Figure 2 | Complementarity analysis between 
the piRNA and mRNA parts of chimaeric 

CLIP tags. a, Strategy for chimaeric CLIP tag 
analysis, and genome browser illustrating Aub 
IgCLIPs on cycB; sequence and base pairing of a 
chimaeric CLIP tag is shown. Chr, chromosome. 
b, c, piRNA-mRNA complementarity events 
(percentage) within +100 bases from the 
midpoint of the mRNA part of the chimaeric 
read, plotted per alignment score for yw (b) and 
tud (c) embryo Aub CLIPs (biological triplicates). 
Percentage and number of total events occurring 
within +25 bases (dashed rectangles) are 

shown. Inset, per sample: bar chart of number of 
complementarity events per score group. d, Bar 
charts of average piRNA-mRNA complementarity 
events occurring within the +25-base window 

of the midpoint of the search area for indicated 
scores and mRNA localization categories and Aub 
CLIP libraries. All error bars denote s.d.; n= 3. 
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piRNA nucleotide position 


Figure 3 | Characteristics of piRNA base-pairing identified by 
chimaeric CLIP tag analysis. a, Heat maps showing base-paired 
nucleotides within the piRNA sequence, for all complementarity events 
(score >7) within +25-base window, for yw embryo and negative control. 
Stacked piRNAs are sorted (bottom to top) by: starting position and length 
of the longest stretch and total number of base-paired nucleotides. Every 
nucleotide position is coloured according to the length of the stretch of 


complementarity is not extensive. The distribution of the complemen- 
tarity events in the negative control (random piRNA) is completely flat 
across the search area and has lower scores (Extended Data Fig. 7a), sug- 
gesting that the chimaeric reads capture genuine sequence-dependent 
Aub-piRNA-mRNA contacts. 

piRNAs in chimaeric reads are typical Aub piRNAs (Extended 
Data Fig. 7b-e). pipRNA-mRNA complementarities with alignment 
score >7 congregate within a 50-nucleotide window (Fig. 2b-d), so 
we focused on events that have such scores and locations. piRNA 
complementarity towards posterior and non-posterior mRNAs is 
indistinguishable (Fig. 2d and Extended Data Fig. 7f), suggesting 
that the basis of mRNA binding preference by Aub is not sequence 
specificity. Chimaeric reads show substantial overlap (Fig. 2a) and the 
same enrichment in posterior-localized mRNAs with non-chimaeric 
lgCLIPs (Supplementary Tables 5 and 6), suggesting that they both 
capture the same RNA binding events. 

Base-paired nucleotides for every piRNA from three replicate 
CLIP libraries are summarized in a comprehensive plot (Fig. 3a and 
Extended Data Fig. 7g), revealing a bimodal distribution of the com- 
plementary regions within the piRNA. Many are found at the 5’ end of 
the piRNA, starting at positions 1 and 2 (reminiscent of miRNA seed- 
type binding); additional base-paired stretches start at positions 9-17 
(Fig. 3a, b). This pattern is absent from the negative control (Fig. 3a). 
Net density of base-paired nucleotides reveals a clear preference for 
piRNAs to use nucleotides at positions 2-6 with additional base pairs 
in positions 16-24 (Fig. 3c and Extended Data Fig. 7h, i). This profile 
is markedly similar in yw and tud libraries, and differs slightly from 
the miRNA hybridization profile” in the less frequent base-pairing 
in the 2-6 region, suggesting that piRNAs do not use a conserved 
seed sequence. The periodicity of the graph in Fig. 3c (Extended Data 
Fig. 7i) evokes the helical conformation and base-pairing availability of 
the small RNA in the context of an Ago-miRNA-target RNA tripartite 
complex’’, suggesting that despite the absence of a conserved seed, the 
mechanics of piRNA complementary binding are analogous to those 
of microRNAs. Analysis of the evolutionary conservation of paired, 
unpaired and flanking nucleotides on the mRNA sequence reveals 
that the piRNA-mRNA contact sites are not preferentially conserved 
(Fig. 3d). 

We used the local alignment approach by which we analysed the 
chimaeric CLIP tags, to identify potential piRNA target sites in the 
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position within the piRNA sequence. c, Average base-paired nucleotide 
density per position minus negative control (random piRNA). d, Average 
mRNA conservation score on and around piRNA-mRNA contact sites. 
All error bars denote s.d.; n= 3. 


D. melanogaster transcriptome. In 206,400,271 total sites, the vast 

majority (99.6%) are of scores 7-11 (Fig. 4a). Importantly, the den- 

sities of putative piRNA target sites on mRNA regions are essentially 
identical for mRNAs with or without posterior localization, and very 

similar to that of the chimaeric mRNA fragments (higher densities in 

the UTRs compared to the coding sequences; Fig. 4b, c and Extended 
Data Fig. 8). 

mRNAs in the 12 posterior localization categories are signifi- 

cantly longer than non-posterior localized mRNAs”* (Fig. 4d), and 
so contain a higher number of piRNA target sites (Fig. 4e); never- 

theless, transcript length normalization eliminates this difference 
(Fig. 4f, g). This holds true when the scores of the predicted sites 
are accounted for (Fig. 4g), and also when the scores are weighted 
for the preference of piRNA nucleotides 2-6 and 16-24 to base- 
pair (not shown). Posterior mRNAs are also more abundant than 
non-posterior; when factored in, this increases the difference of the 
target site abundance per transcript for the two localization categories 
(Fig. 4h). Posterior and non-posterior mRNAs are equally targeted 
(per kilobase) by each piRNA even when piRNA copy number is 
accounted for (Extended Data Fig. 9a). Notably, the size differen- 
tial (and not the absolute length) of posterior and non-posterior 
mRNAs is conserved among drosophilids: the intra-species size dif- 
ferential always favours posterior mRNAs, although non-posterior 
mRNAs from one species might be longer than the posterior mRNAs 
of another (Fig. 4i). Therefore, although piRNAs randomly base pair 
with non-conserved mRNA sequences, this mechanism is biased 
towards a specific class of mRNAs for germ plasm anchoring. 
Additionally, from the two categories of posterior localized mRNAs, 
‘localized’ and ‘protected’!°, localized mRNAs have longer 3’ UTRs 
than protected mRNAs, further supporting the notion that mRNA 
length positively affects germ plasm enrichment (Extended Data 
Fig. 9b, c). 

The concept of mRNA entrapment at the germ plasm during 
ooplasmic streaming is well established®*?’, but the mechanism at 
the molecular level has so far been elusive. We propose that germ 
plasm localized Tud—Aub-piRNA complexes play the role of a non- 
discriminatory adhesive trap that can form numerous, non-conserved 
piRNA-mRNA contacts to capture mRNAs and form germ plasm 
mRNPs (Fig. 4j and Supplementary Discussion). This mechanism 
probably shows preference for posterior mRNAs because they are 
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Figure 4 | Transcriptome-wide prediction of piRNA target sites 

and length differential of posterior-localized mRNAs. a, Number of 
predicted piRNA complementary sites on mRNAs, per score. b, c, Average 
binned density of: chimaeric mRNA fragments (Aub CLIP, yw embryo 
0-2h) along the meta-mRNA. CDS, coding sequence. Error bars denote 
s.d.; n= 3 (b); predicted piRNA complementary sites within all (14,058), 
posterior (380), and non-posterior (6,747) localized mRNAs (c). 

d-i, Box-and-whisker plots of: lengths of mRNAs expressed in yw 
embryos (0-2 h) (d); number of predicted piRNA complementary 

sites per mRNA (e); length-normalized number of predicted piRNA 
complementary sites (f); length-normalized total score of predicted 
piRNA complementary sites (g); number of predicted piRNA 
complementary sites per mRNA multiplied by the abundance of each 
mRNA RPKM (reads per kilobase per million mapped reads) (h); and 
lengths of orthologous mRNAs in other Drosophila species (i). Black 
lines denote median; white dots denote mean. ***P < 0.005, one-sided 
t-test (d); ***P<1 x 10~!%, one-sided Wilcoxon exact rank test (i). j, Aub 


significantly longer and more abundant”®. We believe that the above 
mechanism acts in addition to specific protein-protein, protein-RNA 
and RNA-RNA interactions that are necessary for mRNA transfer 
and anchoring to the posterior, and for translational control!%!778-9”, 
The multivalence of Aub-Tudor interactions probably contributes 
to the formation of multimeric germ granule complexes. We pro- 
pose that germ-cell specification and function by maternal mRNAs, 
and piRNA inheritance converge in Aub. Coupling germ-cell spec- 
ification with piRNA inheritance could be a strategy that increases 
reproductive fitness by ensuring the propagation of robust transposon 
silencing mechanisms to germ cells across generations and across 
the population. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


PGC specification 
and function by germ plasm 


Early embryo 


couples piRNA inheritance with germ-cell specification in Drosophila. 
Aub, carrying symmetrically dimethylated arginine residues (sDMAs) 
dimethylated by Csul methyltransferase, interacts with Tudor, and both 
are localized in the germ plasm during mid-stage oogenesis. Ooplasmic 
streaming at later stages promotes diffusion of mRNPs, facilitating 
random contacts of mRNAs with the germ plasm. Aub piRNAs 

form an adhesive trap that captures mRNAs forming numerous low 
complementarity contacts. mRNAs with posterior functions are longer 
and more abundant than the rest, form more piRNA-mediated contacts 
with the germ plasm, and thus their entrapment is enhanced. Tudor- 
Aub-piRNA-mRNA complexes along with other RNA binding proteins 
form germ granules that contain both piRNAs and mRNAs that induce 
primordial germ cell (PGC) specification. Aub and its RNA cargo is 
incorporated in PGCs, providing the maternal mRNAs that are necessary 
for PGC function and the maternal piRNAs that will propagate an RNA 
immune response against transposons. 
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METHODS 


Drosophila strains, tissue collection. The following strains and heteroallelic 
combinations were used: yw!" as the wild-type stock (yw), aub#N*/2°” (qub) 
and tud'/PI(2R)PurP133 (tud), for aub and tud mutants (loss-of-function), respec- 
tively'°3*3. All flies were grown at 25°C with 70% relative humidity on a 12-h 
light-dark cycle. The 24-day female flies were crossed to yw males for 2 days in 
standard cornmeal food supplied with yeast paste before ovary dissection. Embryos 
collected at well-defined time-windows were dechorionated in 50% commercial 
bleach for 2 min, washed extensively in water and collected in PBS or HBSS or 
fixation solution, depending on downstream applications. 

Antibodies. Antibody against Aubergine (Aub-83) was produced by immu- 
nizing rabbits with Aub peptide (HKSEGDPRGSVRGRC, in which terminal 
cysteine was used to couple to KLH; Genscript) and selected with peptide- 
affinity purification of sera. Other antibodies that were used in this study: mouse 
monoclonal anti-PABP (6E2 clone)**, E7 mouse monoclonal anti-$-tubulin 
(Developmental Studies Hybridoma Bank) and anti-Tudor mouse monoclonal 
(gift from M. Siomi). 

Immunofluorescence. Fixation and immunohistochemistry of dissected ovaries 
and embryos was performed according to standard protocols. Primary antibodies 
against Aub and Tud were used at 1 ng! final concentration. Secondary anti- 
bodies conjugated to Alexa 488 and 594 (Life technologies) were used at 1:1,000 
dilution. Ovary and embryo samples were imaged on Leica TCS SPE confocal 
microscope. 

Aub CLIP-seq (HITS-CLIP, high-throughput sequencing after crosslinking 
and immunoprecipitation). CLIP was performed as previously described for 
Mili, Miwi and MOV10L1I (refs 17, 35, 36). The protocol is described in detail 
previously*® and uses stringent buffer conditions to ensure high specificity. The 
experiment was performed in three biological replicates for each condition (yw 
ovaries, yw embryos 0-2 h, tud embryos 0-2h). Approximately 40 mg of Drosophila 
embryos (0-2h) or ~80 ovaries from 4—6-day females were collected in ice-cold 
HBSS and ultraviolet-irradiated (3 x) at 254nm (400 mJ cm~’). The tissues were 
pelleted, washed with PBS and the final tissue pellet was flash-frozen in liquid 
nitrogen and kept at —80°C. Ultraviolet-light-treated tissues were lysed in 350 il 
PMPG (PBS (no Mg?* and no Ca?*), 2% Empigen) with protease inhibitors and 
rRNasin (2 Upl7!) and no exogenous RNases; lysates were treated with DNase I 
(Promega) for 5 min at 37°C, and then were centrifuged at 100,000g for 30 min 
at 4°C. 

For each immunoprecipitation, approximately 101] of our anti-Aub anti- 
body was bound on 150 1 (slurry) of protein A Dynabeads in Ab binding buffer 
(0.1M Na-phosphate, pH 8, and 0.1% NP-40) at room temperature for 2h; 
antibody-bound beads were washed three times with PMPG. Antibody beads 
were incubated with lysates (supernatant of 100,000g) for 3h at 4°C. Low- and 
high-salt washes of immunoprecipitation beads were performed with 1x and 
5x PMPG (5x PBS, 2% Empigen). RNA linkers (RL3 and RL5), as well as 
3’ adaptor labelling and ligation to CIP (calf intestinal phosphatase)-treated RNA 
CLIP tags were performed as previously described”. 

Immunoprecipitation beads were eluted at 70°C for 12 min using 301 of 2x 
Novex reducing loading buffer. Samples were analysed by NuPAGE (4-12% gra- 
dient precast gels, run with MOPS buffer). Cross-linked RNA-protein complexes 
were transferred onto nitrocellulose (Invitrogen), and the membrane was exposed 
to film for 1-2h. Membrane fragments containing the main radioactive signal 
and fragments up to ~15 kDa higher were excised (Fig. 1a). RNA extraction, 
5/ linker ligation, Reverse transcriptase PCR (RT-PCR) and a second PCR step 
were performed with the DNA primers (DP3 and DP5, DSFP3 and DSFP5) as 
described previously*°. Complementary DNA from two PCR steps was resolved 
on and extracted from 3% Metaphor 1x TAE gels. Size profiles of cDNA libraries 
prepared from the main radioactive signal and higher molecular mass signal 
were similar (Fig. la). DNA was extracted with QIAquick Gel Extraction kit and 
submitted for deep sequencing. The cDNA libraries were sequenced with Hi-Seq 
Illumina at 100 cycles. 

Solid-support directional RNA-seq. Solid-support directional RNA-seq was 
performed as previously described!’, using total RNA (depleted of ribosomal 
RNA with Ribo-Zero (EpiCentre)) isolated from 0-2-h embryos of appropriate 
genotypes. 

Nycodenz density gradient ultracentrifugation and subsequent analyses. 
Nycodenz density gradient separation of RNPs was performed as previously 
described!” with modifications. A 20-60% (top to bottom) Nycodenz gradient 
(4.8ml) in 1x KMH150 (150mM KCl, 2mM MgCl, 20mM HEPES, pH 7.4, 0.5% 
NP-40, 0.1 Up! rRNasin, and protease inhibitors) was prepared as a step gradient 
by overlaying five equal parts of Nycodenz solutions and was left to diffuse over- 
night at 4°C. 0.2 microlitres of post-nuclear yw embryo lysate in 1x KMH150 was 
laid over the gradient and centrifuged at 150,000g for 20h. We used embryos of 
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stages 4-6, to avoid earlier stages where mRNAs at the soma form distinct mRNPs 
than the ones formed in the pole plasm PGCs. The gradient was collected in 12 
equal fractions. Samples from each fraction were used for protein determination by 
Bradford and RNA extraction with Trizol LS. Right before RNA extraction, 500 ng 
of in vitro transcript of Renilla luciferase mRNA was spiked in each fraction for 
normalization purposes in subsequent steps. 

qRT-PCR. An equal volume of RNA extracted from each fraction was reverse 
transcribed by Supersript II (Invitrogen 18080-051) in the presence of 
random hexamers. Equal volume of the cDNA was mixed with primers 
(gcl, osk, Hsp83, dhd, cycB: Qiagen QuantiTect Assay; Renilla luciferase 
(rLuc), forward: 5'-CGCTGAAAGTGTAGTAGATGTG-3’ and reverse: 
5'/-TCCACGAAGAAGTTATTCTCCA-3’) and Power SYBR Green reaction mix 
(Applied Biosystems 4367659). The reactions were run on a StepOnePlus System 
(Applied Biosystems) using the default program. 

Immunoprecipitation and detection of piRNAs, and preparation of cDNA 
libraries. Aub immunoprecipitation, 5’ end labelling of piRNAs and cDNA library 
preparation were carried out as previously described*”**. 

Code availability. We used CLIPSeqTools*, a bioinformatics suite that we created 
for analysis of CLIP-seq data sets (accessible at: http://mourelatos.med.upenn.edu/ 
clipseqtools/) and a Perl programming framework that we developed”. The latter 
framework is named GenOO and has been specifically developed for analysis of 
high-throughput sequencing data. The source code for GenOO has been deposited 
in GitHub and can be accessed at https://github.com/genoo/. 

Statistics. In statistical analyses, we ensured that the assumptions of each statistical 
test are met and that the statistical test used is appropriate for the analysis. In all 
analyses the statistical tests and methods used are clearly stated in relevant sections. 
No statistical methods were used to predetermine sample size. The experiments 
were not randomized and investigators were not blinded to allocation during 
experiments and outcome assessment. 

Data. Drosophila (assembly dm3) transcript, exon and repeat genomic locations 
were downloaded from the UCSC genome browser (downloaded 22 March 2011 
from http://genome.ucsc.edu). Repeat consensus sequences were downloaded from 
Flybase (http://flybase.org/ - transposon_sequence_set v9.42). Localization cate- 
gories for Drosophila genes were taken from ref. 19. The localization annotation 
matrix was downloaded from (http://fly-fish.ccbr.utoronto.ca/annotation_matrix. 
csv). Transposon categories were as in ref. 31. 

Preprocessing. The 3’ end ligated adaptor (GTGTCAGTCACTTCCAGCGGTC 
GTATGCCGTCTTCTGCTTG) was removed from the sequences using the cut- 
adapt software and a 0.25 acceptable error rate for the alignment of the adaptor on 
the read. To eliminate reads in which the adaptor was ligated more than one time, 
adaptor removal was performed three times. 

Alignment. Reads for all samples were aligned against the dm3 Drosophila 
melanogaster genome assembly using the aligner bwa v0.6.2-r126, with the default 
settings”). Reads were also aligned against the Repeat consensus sequences using 
the same aligner. 

Genomic distribution. All mapped reads were divided in the following genomic 
categories: sense repeat, antisense repeat, non-coding RNA, (protein) coding RNA. 
The remaining reads were considered to be intergenic reads. 

Correlation of replicates. Gene expression was defined as the number of reads 
that map on each gene and the values were normalized by the upper quartile nor- 
malization method”. The log, gene expression levels of replicates are compared 
using the Pearson Correlation function in R. 

Coincidence with immunoprecipitation. Reads mapping in the same position 
(same 5’ end mapping) were considered as coinciding. When comparing CLIP 
with immunoprecipitation libraries, the percentage of piRNA-size CLIP reads that 
had a coinciding start with any standard immunoprecipitation read were counted 
as positive. 

Significant localization. For each localization category, the quartile-normalized 
lgCLIP binding level (mRNA expression level’ in each CLIP library) is compared 
via a two-sided t-test between genes that belong to the category versus genes that 
do not belong to it. To compare two samples, we measure the difference in binding 
(per gene) between the two conditions (log>(gene.expr.cond1/gene.expr.cond2)) 
and then perform a t-test of differences in genes belonging to the category versus 
genes not belonging in the category. 

Early embryo posterior localization categories. The following twelve mRNA 
localization categories!’ were found significantly depleted in tud embryo Aub 
CLIP libraries compared to yw embryo libraries, and were used in analyses were 
‘posterior localized mRNAs are mentioned: ‘1:41:RNA islands’ “1:42:Pole buds, 
‘1:40:Pole plasm, ‘3:265:Perinuclear around pole cell nuclei’ “4:370:Germ cell local- 
ization, “4:403:Germ cell enrichment’, ‘3:348:Pole cell enrichment; ‘2:141:Pole cell 
localization, ‘2:153:Perinuclear around pole cell nuclei’, ‘2:142:Pole cell enrich- 
ment; ‘3:347:Pole cell localization, ‘1:59:Perinuclear around pole cell nuclei’ 
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(http://fly-fish.ccbr.utoronto.ca/). The remaining mRNAs are mentioned as 
non-posterior localized mRNAs. The following three posterior localization cat- 
egories were also depleted in tud embryo Aub CLIP libraries compared to yw: 
‘1:39:Posterior localization, ‘2:124:Posterior localization, “3:352:Posterior locali- 
zation. Almost all of the mRNAs contained in the above twelve categories are also 
contained in these three, but these three categories also contain some mRNAs that 
do not actually localize in the pole plasm or the germ cells (that is, with apical local- 
ization); therefore, mRNAs belonging in any of these three localization categories 
but not in any of the above mentioned twelve posterior categories were not consid- 
ered for the generation of the Supplementary Table 4. Many mRNAs do not have 
a designated localization pattern, and they are mentioned as ‘undetermined 
localization. It is worth mentioning that this category contains several mRNAs 
with clear posterior—pole-plasm localization. Through manual searches of the 
Berkeley Drosophila Genome Project chromogenic ISH database (http://insitu. 
fruitfly.org/cgi-bin/ex/insitu.pl) we noticed that many Aub-bound mRNAs, the 
localization of which is not annotated in the Fly-FISH database, are indeed local- 
ized in the germ plasm/cells (such as CG4735/shu, CG7070/PyK, CG4903/MESR4, 
CG5452/dnk and CG9429/Calr), therefore our analysis is most likely underesti- 
mating the true number of Aub-bound mRNAs that are important for germline 
specification and function. Because of this, mRNAs with ‘undetermined locali- 
zation’ were never mixed with ‘non-posterior localized’ mRNAs in our analyses. 
Highly bound genes. To identify highly bound genes, we used the rank prod- 
uct method”. Specifically, genes are sorted by expression per sample, and for 
each gene the product of their ranks is calculated. The probability of this rank 
product produced by chance is calculated by permutations of all non-zero 
value genes. 

Transcript expression calculation. We calculated the expression for protein- 
coding transcripts by counting the number of RNA-seq reads that map within the 
exons of each transcript. The counts were normalized using RPKM and upper 
quartile normalization, effectively dividing each count by the upper quartile of all 
counts”. The transcript with the highest RPKM score was used (‘best transcript’) 
unless otherwise noted. 

Transcript Aub-binding calculation. We calculated the expression for protein- 
coding transcripts by counting the number of CLIP reads that map within the 
exons of each transcript in the sense orientation. The counts were normalized using 
reads per million and upper quartile normalization, effectively dividing each count 
by the upper quartile of all counts’”. 

RNA-seq correlation versus CLIP. Upper quartile normalized RPKM for RNA- 
seq was compared to similarly normalized CLIP binding levels defined as average 
number of reads per transcript in CLIP replicates. Correlation was calculated using 
the Pearson Correlation function in R. 

Identification of hybrid reads. (1) Identified lgCLIP size reads (read length >35) 
that did not align to the genome. (2) Made a set of substrings from both ends of 
reads from (1) of piRNA size (L = [23,29]). (3) Identified the substring from (2) 
to full-length piRNAs (L = [23,29]) from corresponding low samples (Extended 
Data Fig. 1b) (4) The longest aligning piRNAs are retained and coupled with the 
remainder of the read as piRNA-lgCLIP couples. (5) The piRNA aligning fragment 
is cut from the read. Very small remainder reads (L = [<20]) are discarded. (6) The 
remainders are aligned to the genome (using bwa default settings). (7) Remainders 
aligned in one single position that is on a known mRNA are retained. 
Alignment of piRNAs to regions. (1) Regions of 200-nucleotide length were cut 
around the midpoint of the genomic alignment region from step 7 of previous 
routine. Specifically, if (d= 200 the length of the final region we want and L is the 
length of the read), a genomic region flanking the read on each side of length d/2 
was excised from the chromosome sequence. If the alignment was located in the 
minus strand the sequence was reversed and complemented at this point. This 
total region has length d+ L. We discard an equal number of nucleotides from each 
side to reach a final length of L (specifically we substring starting from int(L/2) 
and for d nucleotides. Note, int will always round down). At this point we have 
a region of length 200 nucleotides centred around the alignment region of the 
fragment. (2) We use a slightly modified Smith-Waterman“ alignment method 
(weights: match = +1, mismatch = —1, gap = —2) to align piRNAs on the 200-nucle- 
otide long regions from (1). Differences of our alignment versus Smith-Waterman: 
(a) No penalties are given to non-matching nucleotides on the edges of the align- 
ment. (b) If there are multiple optimal alignment scores, one is picked randomly. 
(c) Alignments in which part of one sequence is outside the boundaries of the other 


sequence are not considered. (3) The midpoint of the alignment (if k nucleotides 
matched that is the int(k/2) nucleotide) is used for graphs of alignment positioning 
on regions. 

mRNA target prediction for the top 2,000 expressed piRNAs. We grouped 
piRNA sequences into families based on the first 23 nucleotides of each piRNA. 
Using the alignment algorithm described above we aligned one piRNA (the most 
abundant) for each of the top 2,000 families to the longest annotated transcript for 
each protein-coding gene. These 2,000 piRNA families represent ~37% of piRNA 
reads from low yw CLIP libraries. To factor in transcript abundance, we multiplied 
the RNA-seq (yw 0-2-h embryo) RPKM value for each mRNA with the number of 
predicted piRNA target sites found within the mRNA. This provides a ‘targeting 
potential’ of every mRNA species, corrected for its abundance. 

We then evaluated the targeting potential of each piRNA-mRNA pair using 

three different scoring schemes. For the first, we sum the alignment score of all 
putative piRNA binding sites on the mRNA. For the second, we calculated a 
weighted alignment score for each putative piRNA binding site and then we sum 
all scores similar to the previous scheme. The weighted score for each binding site 
is calculated based on the following formula >, x; *.A; , in which x; is 1 or 0 based 
on whether the nucleotide at position i of the piRNA is bound or not, and A; is the 
weight for nucleotide. For the third, we multiplied the total number of predicted 
complementary sites per piRNA, with the piRNA copy number. 
Study of the lengths of D. melanogaster orthologous mRNAs in other Drosophila 
species. Transcript sequences (fasta file) for each species were downloaded from 
Flybase (ftp://ftp.flybase.net/genomes/ on 1 September 2015, current version 
used for each genome). For each gene (identified as the ‘parent’ tag in the fasta 
file header), the longest transcript length was identified. For the analysis of the 
expressed mRNAs (Fig. 4d), we used our yw embryo RNA-seq data to identify 
the longest transcript with the highest length normalized abundance. Orthologue 
gene tables were downloaded from Flybase (gene_orthologs_fb_2015_03.tsv.gz) 
and were used to identify orthologue genes across species. For each species, all 
genes that mapped to localized and unlocalized Drosophila melanogaster genes 
were used in the comparison and were assigned to the corresponding group as 
their D. melanogaster orthologue. Boxplots were created using the lattice package 
in R (bwplot) and omitting outliers, P values were calculated using the Wilcoxon 
exact rank test (wilcox.test in R) one-sided with the hypothesis that localized genes 
are longer than non-localized. 
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Extended Data Figure 1 | Endogenous Aub localization in genotypes 
used, sequenced and mapped reads of CLIP sequencing and RNA 
immunoprecipitation libraries used in this study, and general 
characteristics of yw ovary and tud embryo (0-2h) CLIP sequencing 
libraries. a, Immunofluorescence of ovary and early embryo of indicated 
genotypes using antibodies against Aub (Aub-83; green) and Tudor (red), 
and schematic representation of the egg chamber. Aub is localized in 

the nuage and germ (pole) plasm of wild-type ovaries, in the germ plasm 
of early wild-type embryos (stage 2) and within PGCs as they form in 

the posterior pole (stage 5), and as they migrate during gastrulation 
(stage 10). Tudor colocalizes with Aub in the germ plasm of early 
embryos but is not detected after PGC formation. In Tudor mutant early 
embryos, Aub is not concentrated in the posterior but is diffusely present 
throughout the embryo; PGCs are never specified resulting in agametic 
adults (see also Extended Data Fig. 9). b, Sequenced and mapped reads 


of CLIP sequencing (CLIP-seq) libraries prepared in this study. 

c, Sequenced and mapped reads of RNA immunoprecipitation deep- 
sequencing libraries prepared in this study. d, Size distribution for the 
three low (one for tud) and three high yw ovary and tud embryo (0-2 h) 
Aub CLIP-seq libraries. The size range of piRNAs (23-29 nucleotides) is 
indicated by a dashed box. e, Average 5’ end nucleotide composition for 
piRNAs (23-29 nucleotides) from three low yw ovary, tud embryo (0-2 h) 
(one library) and yw embryo (0-2h) Aub CLIP-seq libraries. f, Average 5’ 
end nucleotide composition of CLIP tags from three high yw ovary and 
tud embryo (0-2h) Aub CLIP-seq libraries. piRNAs (23-29 nucleotides) 
are indicated by a dashed box. g, Genomic distribution of CLIP tags for 
three high yw ovary and tud embryo (0-2h) Aub CLIP-seq libraries. 
Overlap of piRNAs from CLIP and immunoprecipitation libraries. All 
error bars denote s.d.; n=3. 
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Extended Data Figure 2 | Pairwise comparisons of transposon piRNA 
populations from various libraries. a—c, Scatterplot comparison of 
normalized abundance of piRNAs mapped on consensus retrotransposon 
sequences (sense and antisense), from yw embryo (0-2 h) standard Aub 
immunoprecipitation and Aub CLIP libraries (a); from yw ovary libraries 
(b); and from tud embryo (0-2h) libraries (c). Pearson correlation is 
shown for all elements in each plot. Retrotransposon categories are set 

as in ref. 31. d—f, Scatterplot comparison of normalized abundance of 
transposon-derived piRNAs in Aub CLIP libraries prepared from higher 
molecular mass signals (high; Fig. la, marked with a light blue line), with 
the piRNAs found in the libraries prepared from the main radioactive 
signal (low; Fig. la, marked with a dark blue line) from yw embryo 


(0-2 h) (d); from yw ovary Aub CLIP ‘high and ‘low libraries (e); and 
from tud embryo (0-2h) Aub CLIP high and low libraries (f). These 
comparisons indicate that the piRNA loads in low and high CLIP 

libraries are essentially identical. g, Scatterplot comparison of normalized 
abundance of transposon-derived piRNAs for yw ovary and tud ovary Aub 
immunoprecipitation libraries, to evaluate changes of piRNA load in the 
absence of Tudor. While antisense-derived piRNAs are largely unchanged, 
a few sense-derived piRNAs are changed (blood retrotransposon is 
indicated). h, i, Scatterplot comparison of normalized abundance of 
transposon-derived piRNAs for yw ovary and yw embryo (0-2h) Aub 
immunoprecipitation libraries (h); and for tud ovary and tud embryo 
(0-2 h) libraries (i). 
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Extended Data Figure 3 | Retrotransposon targeting by 
complementary piRNAs identified by Aub CLIP. a, Overlap of 

IgCLIPs with complementary piRNAs from CLIP libraries, mapping on 
retrotransposons. b, c, Scatterplots of normalized abundance of antisense 
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ized) 


piRNAs and sense lgCLIPs (b) and for sense piRNAs and antisense 
lgCLIPs (c) mapped on retrotransposons for the indicated Aub CLIP 
libraries. Pearson correlation is shown for all elements in every plot. 
Retrotransposon categories are set as in ref. 31. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | CLIP identifies extensive mRNA binding by 
Aub. a, Ratio average plot of normalized (reads per million, RPM) Aub 
CLIP tag (pi, piRNA; lg, lIgCLIP) abundance (A value) versus lgCLIPs 
over piRNA abundance (R value), for all mRNAs. Outlined circles (red) 
correspond to genes that belong in the 12 posterior localization categories 
depleted in tud versus yw Aub CLIP libraries. Zero values are substituted 
with a small (smallest than the minimum) value so that log calculations 
are possible. This graph strongly suggests that mRNA binding by Aub as 
captured by CLIP is not for piRNA biogenesis purposes. b, Sequenced 
and mapped reads of RNA-seq libraries prepared in this study. c, Density 
of Aub CLIP-seq tags (yw embryo, and bottom panel: tud embryo) and 
RNA-seq reads (top panel: yw embryo) within the UTRs and coding 


LETTER 


sequences of the meta-mRNA. Each mRNA region is divided in 30 bins, 
and the number of the chimaeric mRNA fragments (genomic coordinate 
of the mRNA fragment midpoint) mapped within each bin is counted. 
Error bars indicate one s.d., n =3 for CLIP-seq; minimum and maximum 
values for the two RNA-seq replicate libraries. d, Scatterplot of average 
normalized mRNA abundance for yw embryo RNA-seq (RPKM) and Aub 
CLIP-seq (RPM). Aub highly bound mRNAs with posterior localizations 
(Supplementary Table 4) are marked with a red circle. Zero values are 
substituted with a small (smallest than the minimum) value so that log 
calculations are possible. CLIP-seq identifies mRNAs that span the whole 
expression range of RNA-seq libraries, indicating that Aub CLIP does not 
capture transcripts simply based on abundance. 
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Extended Data Figure 5 | Partial purification of Aub RNPs from early 
embryo supports binding of germ plasm mRNAs by Aub. 

a, Fractionation of isopycnic Nycodenz density gradients of post-nuclear 
yw embryo lysate. Protein and Nycodenz concentration for every fraction 
is plotted. b, Western blot detection of indicated proteins in gradient 
fractions. A short and a long exposure (exp.) for Aub is shown. Uncropped 
gels for b, d and e can be found in Supplementary Fig. 1. c, Heat map of 
levels of indicated germ plasm mRNAs determined by quantitative 
RT-PCR (qRT-PCR), normalized to spiked luciferase RNA, and with 


fraction 2 as a reference. d, Western blot detection of Aub in indicated 
diluted Nycodenz fractions used for Aub RNA immunoprecipitation. 

e, Electrophoretic analysis on denaturing polyacrylamide gel of 
*2P_labelled small RNAs immunoprecipitated with Aub from indicated 
gradient fractions. A bracket denotes piRNAs, detected primarily in 
fractions 6 and 7 (asterisk denotes 2S rRNA). f, Bar chart showing fold 
enrichment (over fraction-extracted total RNA) of indicated germ plasm 
mRNAs in Aub immunoprecipitations from gradient fractions, measured 
by qRT-PCR. Luciferase mRNA was used as a spike. 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Analysis of Aub CLIP tags mapping to mRNAs 
with regard to the presence of mRNA embedded transposons. a, Overlap 
of lgCLIPs with complementary piRNAs from CLIP libraries, mapping on 
mRNAs. b, Scatterplot of yw embryo Aub lgCLIPs mapped in the sense 
orientation on mRNAs, with piRNAs mapped in the antisense orientation. 
Zero values are substituted with a small (smallest than the minimum) 
value so that log calculations are possible. Contrary to retrotransposons 
(Extended Data Fig. 3), there is no correlation, suggesting that extensive 
piRNA complementarity cannot explain the widespread mRNA binding 
shown by mRNA lgCLIPs. c, Scatterplot of yw embryo Aub lgCLIPs 
mapped in the sense orientation on mRNAs with per base (nucleotide) 
mRNA embedded retrotransposons (LINE, long terminal repeat (LTR), 
satellite). Posterior, non-posterior and undetermined localizations 

are marked as indicated. The graph is separated into four quadrants: 
clockwise from bottom left corner: 0 embedded repeats, 0 CLIP tags; 0 
embedded repeats, >0 CLIP tags; >0 embedded repeats, >0 CLIP tags, 

>0 embedded repeats, 0 CLIP tags. The number of genes in the four 
quadrants is indicated. Zero values are substituted with a small (smallest 


than the minimum) value (different small value for every localization 
category was used for clarity) so that log calculations are possible. This 
graph suggests that there is no correlation between the numbers of CLIP 
tags and embedded repeats within the mRNAs. d, Aub lgCLIPs density 
surrounding (+200 bases) mRNA-embedded retrotransposons (LINE, 
LTR, satellite as indicated). This analysis shows that there is no increase in 
the lgCLIP density in the areas flanking embedded repeats, suggesting that 
repeat sequences are not used as enriched target areas for mRNA binding 
by Aub. Error bars denote s.d.; n =3.e, Analysis of mRNA expression level 
in relation to the number of embedded repeats. The number of embedded 
repeats per nucleotide of exon was plotted with the ratio (logio) of mRNA 
expression in yw embryo (0-2h) versus aub#X7/2C”? embryo (0-2 h) (left), 
and yw embryo (0-2h) versus tud embryo (0-2h) (right). The mRNAs 

are divided into groups based on the number of embedded repeats. The 
number above each data point denotes the number of mRNAs in each 
group. The graphs suggest that there is no proportional or consistent 
abundance change, decrease or increase, with the number of embedded 
repeats. 
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Extended Data Figure 7 | See next page for caption. 
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Extended Data Figure 7 | Characteristics of piRNAs and piRNA base- 
pairing with complementary target sites identified from analysis of 
chimaeric CLIP tags. a, piRNA-mRNA complementarity events for 

a random piRNA (negative control, average of three yw (top) and tud 
(bottom) embryo (0-2h) samples), within +100 bases from the midpoint 
of the mRNA part of the chimaeric read. Complementarity events are 
plotted per alignment score group as indicated, for clarity. Inset (per 
sample): bar chart of average complementarity events per score group. 

b, Size distribution of the piRNAs identified within chimaeric CLIP tags, 
for yw and tud embryo CLIP libraries. Only the piRNAs implicated in 

the complementarity events occurring within +25 nucleotides from the 
midpoint of the mRNA fragment and with score >7 are analysed in this 
graph, and the graphs in c-e, g-i. c, 5’ end nucleotide preference for the 
piRNAs identified within chimaeric CLIP tags, for yw and tud embryo Aub 
CLIP libraries. d, Genomic distribution for the piRNAs identified within 
chimaeric CLIP tags, for yw and tud embryo Aub CLIP libraries. e, Per 
position nucleotide preference for all piRNAs in Aub yw embryo (0-2 h) 
CLIP library L3 (left), and for the piRNAs identified within chimaeric 


CLIP tags, for yw and tud embryo Aub CLIP libraries. f, Complementarity 
events between piRNAs and mRNA fragments of chimaeric reads, for 
posterior and non-posterior localized mRNAs (yw embryo). The plots are 
separated per score group. g, Heat maps showing base-paired nucleotides 
of piRNAs for all complementarity events identified within chimaeric 
CLIP tags (events occurring within +25 nucleotides from mRNA fragment 
midpoint, score >7) for tud embryo. Colour is according to the length of 
the consecutive stretch of base-paired nucleotides that runs over every 
position (colour code shown on the right). Stacked piRNAs are aligned at 
their 5’ ends and sorted (bottom to top) following these rules: (a) starting 
position of the longest stretch of consecutive base paired nucleotides, 
relative to the piRNA end; (b) length of longest base-paired stretch; (c) 
total number of base-paired nucleotides. h, Base-pairing frequency along 
the piRNA length for yw embryo libraries (blue) and their negative control 
(red). i, Net base-pairing frequency along the piRNA length (red) and net 
density of base paired nucleotides (grey) in mRNAs from chimaeric CLIP 
tags from tud embryo libraries. All error bars denote s.d.; n= 3. 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | Non-chimaeric Aub CLIP tag (IgCLIP), 
chimaeric mRNA fragment and RNA-seq read density along the 
untranslated and coding sequences of mRNAs. a, Average density of 
chimaeric mRNA fragments (Aub CLIP, yw 0-2-h embryo) along the three 
parts of the meta-mRNA. Each mRNA region is divided in 30 bins and 
the number of the chimaeric mRNA fragments (genomic coordinate of 
the mRNA fragment midpoint) mapped within each bin is counted. Inset: 
bar plot showing cumulative density in each mRNA region. b, Average 
density of the chimaeric mRNA fragments on mRNA regions; mRNAs are 
separated into three localization groups as indicated: posterior localized 
(12 categories; Supplementary Table 3), non-posterior and undetermined 
localization. Inset: bar plot showing cumulative density in each mRNA 


region. c, As in a for chimaeric mRNA fragments from Aub CLIP libraries, 
tud embryo (0-2h). d, As in b for chimaeric mRNA fragments from Aub 
CLIP libraries, tud embryo (0-2 h). e, As in a for non-chimaeric lgCLIPs 
from Aub CLIP libraries, yw embryo (0-2h). f, As in b for non-chimaeric 
lgCLIPs from Aub CLIP libraries, yw embryo (0-2h). g, As ina for non- 
chimaeric lgCLIPs from Aub CLIP libraries, tud embryo (0-2h). h, As inb 
for non-chimaeric lgCLIPs from Aub CLIP libraries, tud embryo (0-2h). 
i, As in a for RNA-seq reads, yw embryo (0-2h). j, As in b for RNA-seq 
reads, yw embryo (0-2h). k, As in a for RNA-seq reads, tud embryo 
(0-2h).1, As in b for RNA-seq reads, tud embryo (0-2 h). Error bars 
denote s.d.;n=3. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | Lengths of posterior localized mRNAs in 
Drosophila species; characteristics of embryos used in our studies. 

a, Box-and-whisker plot of the number of predicted piRNA target sites 
(per kilobase of mRNA sequence) for every mRNA-piRNA pair, multiplied 
by the piRNA copy number. Posterior and non-posterior mRNAs are as 
indicated. Black lines denote the median. This graph indicates that the 
‘targeting potential’ (number of predicted complementary sites multiplied 
by the piRNA copy number) of each piRNA against each mRNA is the 
same for the two localization categories, suggesting that the piRNA 

copy number is not a contributing factor for the observed preference of 
posterior localized mRNAs for piRNA adhesion. b, Box-and-whisker 

plot of the lengths of D. melanogaster mRNAs (and their 5’ UTR, coding 
sequences and 3’ UTR parts) that are found in the enriched and protected 
categories, as defined previously’. Black lines denote the median; 

white dots denote the mean. n.s., not significant (P > 0.05); **P <0.01; 
* P < 0.001; one-sided Wilcoxon rank sum test. c, Box-and-whisker plot 
of the lengths of the 3’ UTRs of mRNAs from the indicated Drosophila 


species that are orthologous to the D. melanogaster mRNAs found in the 
localized and protected categories, as defined previously’’. Incomplete 
annotation did not allow us to perform this analysis for all the species 
shown in Fig. 4i.White dots denote the mean. P values of the statistical test 
(one-sided Wilcoxon test) of whether the lengths of the localized versus 
protected mRNAs are different, are shown for each species. d, e, RNA-seq 
scatterplots from 0-2-h wild-type (yw) and 0-2-h Aub-null (aub) 
embryos. Shown in red are posterior localized mRNAs (d) or the top 100 
mRNAs identified from Aub CLIP piRNA-mRNA chimaeric reads (e). 
There is no change in mRNA levels between wild-type and aub mutant 
0-2-h embryos. f, g, Hatch rates (f) and fertility of progeny (g) of embryos 
from indicated genotypes. Note that, unlike Tudor and Csul, the absence 
of Aub (aub"N?/2C#) leads to complete embryo lethality. h, Gross ovary 
appearance of wild-type (yw), Tudor mutant (tud/1/Df]) and Csul mutant 
(csul®™S°) adult flies. Note complete absence of germline ovarian tissue in 
adult flies lacking Tudor or Csul; embryos from these flies develop into 
agametic adults because PGCs are never specified. 
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Extended Data Table 1 | Overlap of piRNAs from CLIP and immunoprecipitation libraries 


Library 1 Library 2 


Aub_IP_yw_embryo_0-2h Aub_CLIP_yw_embryo_0-2h_H1 
Aub_IP_yw_embryo_0-2h Aub_CLIP_yw_embryo_0-2h_H2 
Aub_IP_yw_embryo_0-2h Aub_CLIP_yw_embryo_0-2h_H3 


Aub_IP_yw_ovary Aub_CLIP_yw_ovary_H1 
Aub_IP_yw_ovary Aub_CLIP_yw_ovary_H3 
Aub_IP_yw_ovary Aub_CLIP_yw_ovary_H2 


Aub_IP_tud_embryo_0-2h Aub _CLIP_tud_embryo_0-2h_L1 
Aub_IP_tud_embryo_0-2h Aub_CLIP_tud_embryo_0-2h_H2 
Aub_IP_tud_embryo_0-2h Aub_CLIP_tud_embryo_0-2h_H3 
Aub_|IP_tud_embryo_0-2h Aub_CLIP_tud_embryo_0-2h_H1 


Aub_|P_yw_ovary Aub_CLIP_yw_ovary_L1 
Aub_IP_yw_ovary Aub_CLIP_yw_ovary_L2 
Aub_IP_yw_ovary Aub_CLIP_yw_ovary_L3 


Aub_IP_yw_embryo_0-2h Aub_CLIP_yw_embryo_0-2h_L1 
Aub_IP_yw_embryo_0-2h Aub_CLIP_yw_embryo_0-2h_L2 


unique piRNA 
sequences in 


library 1 


6913438 
6913438 
6913438 
9938639 
9938639 
9938639 
5147948 
5147948 
5147948 
5147948 
9938639 
9938639 
9938639 
6913438 
6913438 


unique piRNA 
sequences in 


library 2 


348812 
838891 
694458 
560082 
293375 
332484 
1257672 
1104187 
2567880 
1040626 
1850192 
2407082 
3082922 
2012094 
2161685 


common 


150654 

333876 

284532 

286627 

156976 

176012 

458182 

460392 

948630 

379030 

874693 

1108175 
1367516 
722743 

769241 


percent 


2.179147336 
4.829377222 
4.115636822 
2.883966306 
1.579451673 
1.770986953 
8.900284152 
8.943213879 
18.42734231 
7.362739484 
8800933407 
11.15016855 
13.75959022 
10.454 17634 
11.12675054 
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percent2 Benes 
43.19060124 4222806 
39.79968792 
40.97180823 
51.17589924 
53.50694504 
52.93848727 
36.43096133 
4169511143 
36.94214683 
36.4232683 
47.27579624 
46.03810755 
44 35778784 
35.91994211 
35.58524947 


Aub_IP_yw_embryo_0-2h  Aub_CLIP_yw_embryo_0-2h_L3 6913438 2701578 902250 13.0506703 _33.39714789 


Comparisons of piRNA sequences found in CLIP and immunoprecipitation libraries from same tissues. 
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Crystal structure of a substrate-engaged SecY 
protein-translocation channel 


Long Li!*, Eunyong Park!+*, JingJing Ling?, Jessica Ingram’, Hidde Ploegh? & Tom A. Rapoport! 


Hydrophobic signal sequences target secretory polypeptides to a 
protein-conducting channel formed by a heterotrimeric membrane 
protein complex, the prokaryotic SecY or eukaryotic Sec61 complex. 
How signal sequences are recognized is poorly understood, 
particularly because they are diverse in sequence and length. 
Structures of the inactive channel show that the largest subunit, 
SecY or Sec61q, consists of two halves that form an hourglass- 
shaped pore with a constriction in the middle of the membrane 
and a lateral gate that faces lipid'"!°. The cytoplasmic funnel is 
empty, while the extracellular funnel is filled with a plug domain. In 
bacteria, the SecY channel associates with the translating ribosome 
in co-translational translocation, and with the SecA ATPase in 
post-translational translocation!!. How a translocating polypeptide 
inserts into the channel is uncertain, as cryo-electron microscopy 
structures of the active channel have a relatively low resolution 
(~10A) or are of insufficient quality~*. Here we report a crystal 
structure of the active channel, assembled from SecY complex, 
the SecA ATPase, and a segment of a secretory protein fused into 
SecA. The translocating protein segment inserts into the channel 
as a loop, displacing the plug domain. The hydrophobic core of the 
signal sequence forms a helix that sits in a groove outside the lateral 
gate, while the following polypeptide segment intercalates into the 
gate. The carboxy (C)-terminal section of the polypeptide loop is 
located in the channel, surrounded by residues of the pore ring. 
Thus, during translocation, the hydrophobic segments of signal 
sequences, and probably bilayer-spanning domains of nascent 
membrane proteins, exit the lateral gate and dock at a specific site 
that faces the lipid phase. 

To determine the structure of an active SecY channel, we initially 
generated in Escherichia coli a translocation intermediate, consisting of 
SecA, SecY complex, and a short segment of a secretory protein fused to 
a fast-folding green fluorescent protein (GFP) (Extended Data Fig. 1a). 
Although this complex could be purified”, it failed to crystallize. 
We therefore reduced the complexity of the system by fusing a short 
segment of a secretory protein directly into SecA. The segment contains 
the signal sequence of OmpA and a short polypeptide following 
it, and was inserted into the tip of the two-helix finger of SecA 
(SecA-OAIns; Fig. 1a and Extended Data Fig. 1b), because the finger 
was seen to protrude into the cytoplasmic cavity of SecY in a structure 
of SecA/SecY complex lacking a translocation substrate®. Using 
E. coli SecA-OAIns and E. coli SecY complex, the inserted secretory 
protein segment was indeed translocated to the periplasm in E. coli, 
as demonstrated by the formation of a disulfide bridge between 
a cysteine introduced C-terminally of the signal sequence and 
a cysteine placed into the plug domain of SecY (Extended Data 
Fig. 2a). This disulfide bridge formed spontaneously; that is, 
without addition of an exogenous oxidant. The introduction of Gln 
residues into the hydrophobic core of the signal sequence abolished 
disulfide bridge formation (Extended Data Fig. 2b), demonstrating 


that an intact signal sequence is required for translocation of the 
polypeptide segment. Similar results were obtained with Bacillus 
subtilis SecA-OAIns and Geobacillus thermodenitrificans SecYE 
(Extended Data Fig. 2c), a complex of increased thermostability 
that is functional in E. coli. After optimization (Extended Data 
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Figure 1 | Architecture of the active SecY channel. a, A secretory protein 
segment was inserted into the two-helix finger of the SecA ATPase 
(SecA-OAIns). The segment contains a linker (grey), the signal sequence of 
OmpaA, consisting of the N-, H-, and C-regions (in red, black, and purple 
letters, respectively), and a region (in light green) that includes a unique 
cysteine (yellow star). Residues in the signal sequence are numbered 
backwards from the cleavage site. The fused segment inserts into the SecY 
channel in vivo and spontaneously forms a disulfide bridge with a cysteine 
in the plug. This complex was used for structure determination. b, Ribbon 
diagram of the complex, viewed from the side. The numbers refer to TMs 
of SecY. The lines indicate the membrane boundaries. A nanobody was 
used for crystallization. 
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Fig. 2d-f), the construct chosen for crystallization contained 
49 residues inserted into the two-helix finger of B. subtilis SecA, with 
a cysteine at position +7 in the region following the signal sequence 
of 20 residues. Channel insertion of the secretory protein segment was 
similar to that observed with the physiological system, containing wild- 
type SecA and a GFP fusion to a secretory protein fragment (Extended 
Data Fig. 1), except that the latter required an additional polypeptide 
segment to span the SecA molecule. Thus, our simplified system is a 
faithful mimic of normal initiation of protein translocation. Binding of 
SecA to the SecY complex seems to be sufficient to cause polypeptide 
chain insertion into the channel, similar to how ribosome binding 
allows nascent chain insertion in co-translational translocation!’. In 
our system, disulfide crosslinking at the periplasmic side made channel 
insertion irreversible. 

The disulfide-bridged complex of B. subtilis SecA-OAIns and 
G. thermodenitrificans SecYE was purified and crystallized in the 
presence of ADP and BeFx (Extended Data Fig. 3), conditions that 
lock SecA into a conformation close to its ATP-bound state and 
maximize the affinity of SecA for the channel”"“. The diffraction of the 
crystals was improved by the use of single-domain antibody fragments 
(nanobodies), raised against G. thermodenitrificans SecYE and selected 
for binding to periplasmic loops of the SecA-OAIns/SecYE complex, 
and by soaking crystals with a TagBr12 metal ion cluster. The structure 
was determined from multi-wavelength anomalous diffraction (MAD) 
data obtained with a crystal that diffracted to a resolution of 3.70 A along 
one axis and 4.48 A along the other two (Fig. 1b, Extended Data Fig. 4 
and Extended Data Table 1). An initial experimental electron density 
map had a resolution of ~5.5 A. This map was improved by density 
modification and molecular replacement using higher-resolution 
structures of SecA, SecYE, and the nanobody, followed by cycles of 
model building and refinement. Inclusion of the model-refined TagBrj2 
clusters as the resolved heavy atom substructure for recalculation 
of MAD phases did not further improve the map. Nevertheless, the 
final map allowed the unambiguous placement of all transmembrane 
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Figure 2 | Conformational changes of the SecY 
channel. a, Comparison of the lateral gate of the 
active G. thermodenitrificans channel (salmon, 
except for TM7 in cyan) with the closed SecY 
channel of M. jannaschii (in tan). The largest 
changes are indicated by arrows. The translocating 
peptide segment is omitted for clarity. b, As in a, 
but a comparison with that of the SecA-bound 

T. maritima channel lacking a translocating 
polypeptide (pink). c, Comparison of the plugs 
(orange for idle M. jannaschii channel and yellow 
for the active G. thermodenitrificans channel). Pore 
residues are shown as red spheres. d, As in c, but 
comparison with the inactive T. maritima channel 
(plug in magenta). 


: Lateral 
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segments (TMs) of SecY and SecE and of many other regions. The 
translocating polypeptide segment could be built into the map without 
model bias (Extended Data Fig. 4c). As expected, the nanobody bound 
to the periplasmic side of SecY, interacting with both the plug and the 
loop between TM3 and TM4 (Fig. 1b and Extended Data Fig. 5a). 

The structure of the active channel shows that SecA undergoes rel- 
atively small conformational changes compared with a Thermotoga 
maritima complex lacking a translocating chain? (Extended Data Fig. 6). 
SecA binds to the cytoplasmic loop between TM8 and TM9 and 
the C-terminal tail of SecY (Extended Data Fig. 5b, c). It probably 
binds only weakly to the loop between TM6 and TM7, as its tip is 
disordered. In contrast to the ribosome §, SecA also binds to the amino 
(N)-terminal half of SecY; that is, the loop between TM2 and TM3 
(Extended Data Fig. 5b). Thus, at least in its ATP-bound state, SecA 
prevents large relative movements of the two halves of SecY. 

SecY also undergoes relatively small changes, except at the lateral 
gate (Fig. 2). Compared with the idle Methanocaldococcus jannaschii 
or Thermus thermophilus channels’, only TM7 (M. jannaschii) or 
TM7 and TM8 (T: thermophilus) significantly shift their positions 
(Fig. 2a and Extended Data Fig. 7a). Compared with the SecA-bound 
T. maritima channel’, the periplasmic ends of TM3 and TM7 move 
towards each other, and TM7 tilts by 10° relative to the plane of the 
membrane (Fig. 2b), changes that generate a pocket for the signal 
sequence (see below). In both structures, the lateral gate is partly open 
(compare Fig. 2a and Fig. 2b). 

In the active G. thermodenitrificans SecY channel, the plug consists 
of two 8-strands and therefore differs from the a-helical structures 
observed in other species!*> (Extended Data Fig. 8a). Such 
variability is consistent with the fact that the amino-acid sequence 
of the plug region is least conserved!, and that plug deletions cause 
neighbouring polypeptide regions to form new plug domains’>. 
Different plug structures can probably be tolerated, as long as they 
fill the extracellular cavity of the channel, so that the closed state of 
the channel is stabilized and small molecules cannot pass through it. 
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However, it is possible that the plug has different conformations in the 
closed and active channels. 

Whereas the plug is close to the central constriction in the closed 
M. jannaschii channel’, in the active G. thermodenitrificans channel 
it moves to the periplasmic side and towards the back of the channel, 
away from the lateral gate (Fig. 2c). The plug comes close to the TM 
of SecE, consistent with disulfide crosslinking experiments!®!”. Ina 
SecA-SecY structure lacking a translocation substrate®, the plug moves 
a smaller distance and towards the front (Fig. 2d), partly sealing the 
opened lateral gate (Extended Data Fig. 8b). In an intact membrane, 
this would prevent surrounding lipid molecules from moving through 
the lateral gate into the extracellular cavity. The plug is probably flexible 
in the active channel, but in our crystal structure it is confined both by 
the disulfide bond to the translocating chain and by the interaction with 
the nanobody (Extended Data Fig. 5a). Indeed, in a 6.5 A-resolution 
structure determined without nanobody, the plug is shifted further 
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Figure 3 | The polypeptide in the channel. 

a, Cut-away side view of the translocating 
polypeptide. The N-, H-, and C-regions of 

the signal sequence are indicated. b, View 
perpendicular to the axis of the signal 

sequence helix. c, Side view showing the 
translocating polypeptide together with a surface 
representation of the channel, with hydrophilic 
and hydrophobic residues in blue and orange, 
respectively. d, Side view of a cut through a 
surface representation model, with the main 
chain of the translocating chain shown as blue 
sticks. The residues located in the pore ring (red) 
are indicated. The plug is in orange. e, Top view 
of a slab, showing pore residues and Gly19 of 


TM3 the translocating chain as stick and balls with 
Signal a mesh surface representation. Pore residues 
sequence displaced in the active channel are in yellow. 


f, As in e, but with the closed M. jannaschii 
channel. 175 and 1170 correspond to V74 and 
L179 in G. thermodenitrificans, respectively. 


M. jannaschii 


towards the back, although there are otherwise only small differences 
(Extended Data Fig. 9a). 

The signal sequence of the secretory protein segment forms a helix 
that is tilted ~45° relative to the plane of the membrane (Fig. 3a). The 
positively charged N terminus of the signal sequence (N-region) is in 
the same plane as hydrophilic residues of SecY. In an intact membrane, 
the N-region could interact with the negatively charged head groups 
of the phospholipid bilayer. This interaction may retain the N terminus 
on the cytoplasmic side of the membrane while the C-terminal end 
of the signal sequence moves through the channel, resulting in loop 
insertion of the translocating polypeptide. 

The hydrophobic core of the signal sequence (H-region; residues 
—17 to —6 of the original OmpA sequence; Fig. 1a) sits in a groove out- 
side the lateral gate of SecY and forms a helix that runs almost parallel 
to TM2 (Fig. 3a, b). Some residues make van der Waals contacts with 
hydrophobic amino acids in TM2 (Extended Data Fig. 7c), but most 
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Figure 4 | Scheme of SecA-mediated protein translocation. Stage a 
corresponds to the closed channel’, stage b to the structure of the inactive 
SecA-SecY complex’, stage c represents an intermediate to stage d, which 
corresponds to the structure of the active channel. Stage e is attained after 


face detergent (Fig. 3b), and would be in contact with hydrocarbon 
chains of phospholipids in an intact membrane. 

The C-terminal region of the signal sequence (C-region; residues 
—5 to —1; Fig. 1a) replaces the periplasmic end of TM7 in the closed 
M. jannaschii channel (Extended Data Fig. 7b). This segment forms a 
distorted, amphipathic a-helix that is intercalated between TM7 and 
TM3 into the periplasmic side of the lateral gate (Fig. 3a, c). In our 
model, the side chains of Thr(—5) and Ser(—2) point into the peri- 
plasmic cavity previously occupied by the plug in the closed channel 
(Extended Data Fig. 7d). The hydrophobic residues Val(—4) and 
Ala(—3) face detergent/lipid. The lateral gate is thus sealed by the 
C-region from surrounding lipid molecules, which could otherwise 
pass through a large opening generated by the displacement of the plug 
from the front (Fig. 3c). After signal sequence cleavage, the periplasmic 
parts of TM7 and TM3 probably move towards each other and seal 
the lateral gate. 

The hydrophilic polypeptide segment following the signal sequence 
adopts a partly extended conformation with a loop in the periplasmic 
cavity, centred on the cysteine used for crosslinking to the plug. The 
signal sequence cleavage site is located inside the channel (Fig. 3c), 
probably inaccessible to the periplasmically disposed active site of 
signal peptidase!®. Thus, at some point during translocation, the 
C-region of the signal sequence probably must adopt a more extended 
conformation, a change also suggested by experiments with synthetic 
signal peptides!?”°. 

The polypeptide chain inside the channel is perpendicular to the 
plane of the membrane (Fig. 3a). The two strands of the hairpin formed 
by the translocating polypeptide do not interact with one another, so 
that during translocation the C-terminal part of the hairpin could move 
unimpeded through the centre of the channel. Our model places resi- 
due Gly(+19), or one of the neighbouring residues (Ala18 or Gly20), 
of the translocating polypeptide inside the pore ring (Fig. 3d). The 
density around Gly19 is particularly strong, indicating that this seg- 
ment is confined by the surrounding four pore ring residues (Ile78, 
Ile183, Ile275, Ile404). The ring is wider than in the idle M. jannaschii 
or SecA-bound T. maritima channels (diameters 8.8 A versus 5.6 A or 
6.6A respectively). Crystallization may have favoured the presence 
of small amino acids in the pore, minimizing its expansion by the 
presence of a translocating chain. However, even a small increase in 
pore diameter would allow the passage of amino acids with larger side 
chains. The pore ring residues fit snugly around the translocating pol- 
ypeptide (Fig. 3e), confirming that they form a ‘gasket’ that maintains 
the permeability barrier for ions and other small molecules during 
translocation”!. Consistent with disulfide crosslinking experiments”, 
only pore ring residues contact the translocating chain (Fig. 3d). Thus, 
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signal sequence cleavage. The translocating polypeptide is shown in green 
and the signal sequence as a green cylinder. The lateral gate of the channel is 
shown as a broken line on the left. The clamp of SecA is indicated. 


the hourglass-shape of the channel minimizes interactions with the 
translocation substrate, facilitating its movement through the channel. 

The pore ring of the idle M. jannaschii channel contains two addi- 
tional residues (Ile75 and Ile170; Fig. 3f). In the active channel, the 
corresponding residues (Val74, Leu179) are displaced (Fig. 3e). The 
pore ‘ring’ is thus open at the lateral gate between Ile78 in TM2 and 
Ile275 in TM7 (Fig. 3e). These features suggest that a translocating 
polypeptide segment continuously encounters the hydrocarbon chains 
of surrounding lipids; when sufficiently hydrophobic, the segment will 
partition into the lipid phase and become a TM domain of a membrane 


Our crystal structure probably reflects the physiological situation 
of a translocating polypeptide. Five of the seven residues of the 
N-terminal linker are invisible and thus probably flexible, allowing 
unrestricted interaction of the signal sequence with the channel. In 
addition, most polypeptide segments following the signal sequence 
are in a relaxed conformation, unconstrained by fusion to SecA, or 
the disulfide bridge to the plug. The disulfide bridge helps to stabilize 
the signal sequence in the channel, but it probably does not lead to 
gross distortions, because the plug is mobile and the disulfide bridge 
is formed spontaneously in vivo. 

The crystal structure of the active channel leads to a refined model 
for post-translational protein translocation in bacteria (Fig. 4). The 
SecY channel is initially in the idle state, with the plug in the centre and 
the lateral gate closed (Fig. 4a). Binding of SecA primes the channel 
for the arrival of a secretory protein precursor: the lateral gate is partly 
opened, the pore ring widened, and the plug domain moved towards 
the front (Fig. 4b). Next, the secretory protein inserts into the channel 
as a loop, with the C-terminal section of the polypeptide hairpin in 
the pore proper, surrounded by pore ring residues (Fig. 4c, d). During 
subsequent cycles of ATP hydrolysis, SecA uses a ‘push-and-slide’ 
mechanism to move the C-terminal part of the polypeptide loop 
through the pore’, Eventually, the signal sequence is cleaved by signal 
peptidase (Fig. 4e). 

During loop insertion, the H-region of the signal sequence moves 
through the cytoplasmic part of the lateral gate and ends up ina hydro- 
phobic groove on the outside, while the following hydrophilic segment 
crosses the lateral opening of the pore ring. A signal sequence might 
move through a partly open gate in an extended conformation, or it 
could move through a widened gate as a preformed helix. The latter 
possibility is suggested by a ~10 A-resolution electron microscopy 
(EM) structure of a ribosome/nascent chain/channel complex, in 
which a signal sequence helix was seen inside the lateral gate®, probably 
prevented from exit by a disulfide bridge between a cysteine at the 
end of the signal sequence and a cysteine in the plug. The groove on 
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the outside of the lateral gate appears to be a general binding site for 
hydrophobic sequences, as indicated by a 8.5 A resolution structure of 
a complex in which the OmpA signal sequence was replaced by that 
of DsbA (Extended Data Fig. 9b, c), and EM structures of ribosome- 
channel complexes, in which TMs of nascent membrane proteins 
are located at about the same position”** (Extended Data Fig. 7e). 
Like TMs, signal sequences appear to be recognized mostly by lipid 
partitioning, consistent with their ability to be crosslinked to lipids”®7, 
and the correlation between the partitioning of synthetic peptides 
into hydrophobic solvents and their function as signal sequences 
in vivo~®, Nevertheless, amino-acid interactions with TM2 of the channel 
may contribute to the recognition of signal sequences. This may 
explain why some amino acids occur more frequently than others in 
signal sequences, even when they have about the same hydrophobicity: 
leucine is preferred over isoleucine and valine”, perhaps because its 
extended side chain can make tighter van der Waals contacts with 
residues in TM2. Whereas signal sequences of translocating secretory 
proteins would tend to stay in the binding pocket until they are cleaved 
off, the more hydrophobic TMs of membrane proteins could move 
away once the connecting loop to the polypeptide segment inside the 
channel pore attains adequate length. 

While this paper was under review, a cryo-EM structure was pub- 
lished describing an active ribosome-bound mammalian Sec61 channel 
containing a short secretory polypeptide segment*”. The authors con- 
clude that the signal sequence helix replaces TM2 of Sec61a in the 
idle channel, implying that the signal sequence is intercalated into the 
lateral gate and raising the possibility that the conformational changes 
differ greatly from those in our system. However, a superposition of 
the two active channels based on secondary structure matching shows 
that they are actually very similar, with only moderate differences 
at the periplasmic/luminal ends of some TMs (Extended Data Fig. 7f). 
Importantly, in both cases the signal sequence helix docks to the same 
site outside the lateral gate and runs parallel to TM2 (Extended Data 
Fig. 7f). Thus, regardless of the organism and mode of translocation, 
lipid partitioning appears to be the major mechanism by which signal 
sequences are recognized. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Protein translocation and disulfide crosslinking assays. To test in vivo OmpA- 
GFP translocation by E. coli SecA and E. coli SecY complex (three-component 
system), E. coli strain EP52 or EP62 (Armf AompT secY-CBP)”” was transformed 
with pACYC-SecYEG expressing E. coli SecYEG complex with SecY containing 
a unique Cys at position 68 (ref. 21). The cells also expressed OmpA-GFP from 
pBAD-OmpA-GFP under an arabinose-inducible promoter”. In all constructs, 
including those used in the two-component system, position —1 of the signal 
sequence was mutated to Tyr to prevent signal peptide cleavage*!. The cells were 
grown at 37°C to log phase in lysogeny broth (LB) medium supplemented with 
100g ml! ampicillin and 401g ml“! chloramphenicol. OmpA-GFP expression 
was induced by addition of 0.15% L-arabinose for 1 h. Where indicated, 0.3 mM 
copper(II) 1,10-phenanthroline (CuPh;) was added to the bacterial culture for 
15min at room temperature (22°C). The culture was treated with 10 mM N-ethyl 
maleimide (NEM) for 30 min on ice to block free cysteines. The cells were lysed 
in SDS sample buffer. Samples (equivalent amounts based on an absorbance at 
600nm, Agoonm Of E. coli cultures) were subjected to non-reducing SDS-PAGE and 
analysed by western blotting with anti-SecY- or GFP-antibodies. Where indicated, 
the samples were treated with 2% 3-mercaptoethanol (3-ME) at 50°C for 20 min 
before SDS-PAGE. To test the translocation of E. coli SecA-OAIns fusion with 
E. coli SecY complex (two-component system), EP52 cells were transformed 
with pACYC-SecYEG and pBAD-EcSecA-OAIns. The cells were grown to log 
phage in LB medium supplemented with 100,1g ml“! ampicillin and 40;.gml"! 
chloramphenicol, and SecA-OAIns expression was induced by addition of 0.1% 
L-arabinose for 4h at room temperature or 30 min at 37 °C. Disulfide crosslinking 
and non-reducing SDS-PAGE analysis were performed as described above. 
Translocation by B. subtilis SecA and G. thermodenitrificans SecY complex was 
tested similarly. In the case of the three-component system, E. coli strain EP51 
(Armf AompT)”! was transformed with pTet-G. thermodenitrificans SecEyiss/ 
Yuiso. Which expresses bicistronic G. thermodenitrificans secE and secY genes 
(SecY contains a Cys at position 60) under a tetracycline-inducible promoter”?. 
The cells were also transformed with pBAD-OmpA-GFP/B. subtilis SecA, a mod- 
ified version of pPBAD-OmpA, which contains an additional ribosome binding site 
and the B. subtilis secA gene (for bicitronic expression) following OmpA-GFP. In 
the case of the two-component system, EP51 cells were transformed with pTet- 
G. thermodenitrificans SecEyjisg/Y iso and pBAD-B. subtilis SecA-OAIns. The cells 
were grown to log phase at 37°C, and the expression of G. thermodenitrificans 
SecYE was induced by addition of 200ngml“! anhydrotetracycline for 1.5-2h at 
37°C. Then, the expression of OmpA-GFP/BsSecA or BsSecA-OAIns was induced 
by addition of 0.2% L-arabinose for 1-3 h. The cells were collected and subjected to 
SDS-PAGE followed by western blotting as described above. G. thermodenitrificans 
SecY was detected using anti-His antibodies. 
Nanobody library generation. Purified G. thermodenitrificans SecYE in 
n-dodecyl B-p-maltoside (DDM)-containing buffer was injected into an alpaca to 
elicit an immune response. A male alpaca (Vicugna pacos) was purchased locally, 
maintained in pasture, and immunized following a protocol authorized by the 
Tufts University Cummings Veterinary School Institutional Animal Care and 
Use Committee. After five rounds of immunization, total RNA was isolated from 
~10° fresh peripheral blood lymphocytes, using an RNeasy Plus Mini Kit (Qiagen), 
following the manufacturer’s instructions. Total RNA was used to synthesize a 
complementary DNA (cDNA) library using SuperScript III reverse transcriptase 
(ThermoFisher Scientific) with a combination of random hexamers, oligo(dT), and 
gene-specific primers. The variable fragments of heavy chain antibodies (VHHs) 
segments were further amplified from this cDNA library using primers specific to 
the VHH region*’. PCR products were pooled, digested with NotI-HF and Ascl 
(NEB), gel purified, ligated into a M13 phagemid vector (pJSC), and transformed 
via electroporation into TG1 E. coli (Agilent). Library complexity was assessed by 
serial dilution and plating on 2YT agar plates supplemented with 2% glucose and 
10g ml"! ampicillin. 
Selection of nanobodies by phage display. Purified SecYE and SecA-OAIns/ 
SecYE proteins were biotinylated via coupling to primary amines with a fivefold 
molar excess of Chromalink NHS biotin reagent (Solulink) for 90 min. Excess bio- 
tin reagent was removed using a ZeBa desalting column (ThermoFisher Scientific). 
Twenty micrograms of each protein were mixed with 10011 of MyOne Streptavidin 
T1 Dynabeads (ThermoFisher Scientific) blocked with 2% BSA. The beads were 
incubated with 200 il of phage at 101? plaque-forming units per millilitre for 1h 
at room temperature. Non-binding phage was washed away and bound phage 
was eluted first by incubating with 1 ml of saturated ER2738 culture, followed by 
200mM pH 2.2 glycine. The elutions were neutralized, pooled, and plated onto 2YT 
agar plates supplemented with 2% glucose, 5j.gml! tetracycline, and 10 jg ml! 
ampicillin. A second round panning was performed with 21g of each protein 
and 40,1 MyOne Streptavidin T1 Dynabeads. All procedures were conducted in 
20mM Tris-HCl pH 7.5, 150mM NaCl, 10% glycerol, 0.02% DDM for SecYE, 


and with 20 mM HEPES-KOH pH 7.0, 150mM NaCl, 10% glycerol, 0.02% DDM, 
5mM MgCh, 0.1mM ADP/BeFx for SecA-OAIns/SecYE. For each protein, 
95 clones were sequenced, and sequences that appeared more than 5 times were 
selected for subsequent validation. 

Nanobody screening. Thirteen distinct families of nanobodies directed against 
G. thermodenitrificans SecYE were identified by DNA sequencing. Twenty-two 
nanobody clones were sub-cloned into the pHEN6 vector**, which adds an 
N-terminal pelB sequence and C-terminal sortase and His¢ tags (LPETGG-Hiss). 
The proteins were expressed in 5 ml E. coli cultures. After Ni-resin purification, the 
nanobodies were labelled with Alexa 555 (Invitrogen) by sortase reaction*!. The 
labelled nanobodies (1 |1M concentration) were mixed with G. thermodenitrificans 
SecA-OAIns/SecYE complex or with SecYE alone at a molar ratio of 2:1. Nanobody 
binding was monitored by a shift of the peak in size-exclusion chromatography 
coupled with a fluorescence detector (Shimadzu). All 22 nanobodies bound to 
SecYE, but only nanobody AYC08 had a high affinity for SecA-OAIns/SecYE. The 
binding of AYC08 to free SecYE was weaker than to SecA-OAIns/SecYE, indicating 
that AYC08 interacts with the periplasmic side of SecY and that the binding epitope 
is only fully exposed in the active channel. 

Protein expression and purification. E. coli strain EP51 was transformed with 
plet-G. thermodenitrificans SecEyjisg/Y and pBAD-B. subtilis SecA-OAIns49(L7). 
Residues 202-213 in the loop between TM5 and TM6 of SecY were replaced by 
the sequence TFGGLN. Cells were grown in LB medium supplemented with 
100,j1g ml! ampicillin, 40 ,gml~‘ chloramphenicol, and 0.5% glycerol at 37°C 
until Agoonm reached 0.6-0.7. The expression of the G. thermodenitrificans SecYE 
was induced by addition of 200 ng ml! anhydrotetracycline, and cells were incu- 
bated for 1.5 h at 37°C and an additional 1 h at 22°C. Then 0.15% L-arabinose was 
added to the culture to express B. subtilis SecA-OAIns overnight at 16°C. Cells were 
harvested by centrifugation and stored at —80°C until use. 

The cells were suspended in buffer A (20mM Tris-HCl pH 7.5, 150 mM 
NaCl, 1mM phenylmethanesulfonyl fluoride) and lysed in a microfluidizer 
(Microfluidics). The membranes were pelleted by ultracentrifugation, washed 
once with buffer B (20mM Tris-HCl pH 7.5, 300mM NaCl), and solubi- 
lized with 1% DDM (Anatrace) in buffer C (20 mM Tris-HCl pH 7.5, 150 mM 
NaCl, 10% glycerol). After 1 h incubation at 4°C, the solution was clarified by 
ultracentrifugation. The supernatant was loaded onto a 5 ml POROS-MC20 
column (Applied Biosystems) pre-charged with CoCl). After washing with 15 ml 
of buffer D (as buffer C, but with 0.02% DDM) containing 10 mM imidazole and 
5 ml of buffer D containing 15 mM imidazole, the protein was eluted with 5.5 ml 
of buffer D containing 250 mM imidazole. Immediately after elution, 1 mM ADP/ 
BeFx, 5mM MgCh, and 0.1 mg ml! E.coli polar lipids (25 mg ml stock dissolved 
in 1% DDM) were added. To cleave the GFP-strep tag, 3C protease was added at 
a ratio of 1:30 (w:w) and the mixture was incubated overnight at 4°C. The sample 
was diluted 1:1 (v:v) with buffer E (20 mM Tris-HCl pH 7.5, 10% glycerol, 0.02% 
DDM, 5mM MgCl,) and loaded onto a Mono Q 10/100 column (GE Healthcare). 
The protein was eluted with a gradient of 15-35% buffer F (20mM Tris-HCl 
pH7.5, 1 M NaCl, 10% glycerol, 0.02% DDM, 5mM MgCl). The peak fractions 
were collected, and 0.1mg ml! E.coli polar lipids and 1 mM ADP/BeFx were 
added. The protein was concentrated with an Amicon filter (100kDa MWCO, 
EMD Millipore) and loaded onto a Superdex 200 10/300 column (GE Healthcare) 
in buffer G (20 mM HEPES-KOH pH 7.0, 100 mM NaCl, 10% glycerol, 0.02% 
DDM, 5mM MgCh, 1mM ADP/BeFx). The peak fractions were concentrated 
to ~12mg ml, aliquoted, and flash-frozen in liquid nitrogen. The protein was 
stored at —80°C and thawed right before crystallization. 

The plasmid coding for nanobody AYC08 was transformed into WK6 cells. The 
cells were grown in 2xYT medium at 37°C and protein expression was induced 
with 0.5 mM isopropyl B-p-1-thiogalactopyranoside (IPTG) at A¢oonm = 0.6. The 
incubation was continued overnight at 30°C. About 5 g of cells were obtained 
from 1 L of culture. After suspension in 15 ml of TES buffer (200mM Tris-HCl pH 
8.0, 500 4M EDTA, 500 mM sucrose), the sample was diluted 1:3 (v:v) in ice-cold 
water for 3 h to induce cell lysis. After centrifugation, the supernatant was mixed 
with 5 ml of Ni-NTA resin (Qiagen) and incubated at 4°C for 1 h. The protein was 
eluted with 250 mM imidazole. The protein was further purified by gel filtration on 
a Superdex 200 10/300 column in 20 mM HEPES-KOH pH 7.0, 100mM NaCl, 10% 
glycerol, 5mM MgCl). The purified nanobody was concentrated to 30 mg ml |. 
Crystallization. Initial crystals were obtained with a complex of 
G. thermodenitrificans SecYE and either B. subtilis SecA-OAIns and or B. subtilis 
SecA-DsbAlIns (the last containing the signal sequence of DsbA instead of that of 
OmpA). Only small crystals were obtained and diffracted to a maximum resolu- 
tion of 6.5 A at the synchrotron X-ray source. A variety of approaches were tested 
to improve the crystals, such as inserting different hydrophilic proteins into each 
of the periplasmic loops of SecY (T4 lysozyme*, cytochrome b-562 (ref. 36), 
P1/P4 domain of SecD/F*’, ROP helical bundle**), truncating SecY loops, 
using various detergents, co-expressing SecG, and employing Fab-fragments 
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of monoclonal antibodies generated against SecY. However, crystals with 
improved diffraction were only obtained when the complex was co-crystallized 
with nanobody AYC08. 

The complex of G. thermodenitrificans SecYE and B. subtilis SecA-OAIns was 
mixed with nanobody AYC08 at a molar ratio of 1:1.2 with addition of 1!mg ml"! 
lipids (42 mg ml! 1,2-dipalmitoyl-sn-glycero-3-phosphoglycerol (DPPG) plus 
1,2-dipalmitoyl-sn-glycero-3-phosphoethanolamine (DPPE) (3:1) suspension in 
0.5% DDM)*”. The mixture was incubated at 4°C overnight and clarified by ultra- 
centrifugation before setting up crystallization trays. The initial crystal screening 
yielded several crystal forms. Three of them were readily reproducible, but all dif- 
fracted to a maximum of ~6 A resolution. Heavy atom compounds were screened 
for crystal soaking and the TagBrj, cluster improved the resolution limit of one crys- 
tal form. The best crystals were obtained with the hanging drop method, mixing 
0.5 tl of the protein solution and 0.5 11 of well solution (21-24% polyethylene glycol 
(PEG) 1500, 100 mM Tris-HCl pH 8.5, 50-100 mM MgAcp, 2% 2-methyl-2,4- 
pentandiol; MPD) and using 24-well VDX plates with 500 jl well solution. The 
crystals were grown at 22°C over a week. The TasBrj2 powder (Jena Bioscience) was 
suspended in buffer at a concentration of 20mM and added to the crystallization 
drops at a final concentration of ~2 mM. After overnight incubation, the crystals 
were flash-frozen in liquid nitrogen. 

X-ray data collection and structure determination. Hundreds of crystals were 
screened at NE-CAT and GM/CA-CAT of the Advanced Photon Source (Argonne 
National Laboratory). The diffraction of the crystals decayed rapidly and was 
weak, caused by strong heavy atom absorption. However, a complete set of three- 
wavelength MAD data from a single crystal could be collected at GM/CA-CAT. The 
data were processed with the XDS package” and analysed by the program Aimless"!. 
The crystal belongs to the P6,22 space group. The diffraction was anisotropic. 
Along axis c*, the diffraction went beyond 3.70 A (I/sigma=2.0), whereas along 
the axes a* and b*, the diffraction was limited to 4.48 A (I/sigma = 1.5). The 
data were initially processed to 3.9 A. Anisotropy correction was applied in the 
different programs used in the following calculations. Molecular replacement 
was used to locate SecA, SecYE, and the nanobody, employing as search models 
B. subtilis SecA (PDB accession number 1TF5), T. maritima SecYE (PDB accession 
number 3DIN), and an anti-DHFR nanobody (PDB accession number 4EIG), 
respectively. The crystal contained one complex per asymmetric unit with a sol- 
vent content of 69%. Heavy atom sites were identified with the molecular replace- 
ment results by MR-SAD in Phaser” and refined without molecular replacement 
models in Sharp (Global Phasing). The experimental map based on the positions 
of the heavy metal ion clusters contained useful phase information to 5.5 A (judged 
by phasing power of 1). The overall figure of merit was 0.44 (acentric) and 0.42 
(centric). The phases were extended to 3.9 A through density modification, using the 
programs Resolve* and CNS™. After density modification, most of the a-helices 
were well resolved and some of the large side chains were visible in B-factor-sharpened 
maps. To improve the density map, model phases were combined with experimen- 
tal phases. Models for SecA and the nanobody molecules were placed first and 
modified according to the density map. The SecY and SecE molecules were then 
built with the guidance of B-factor-sharpened maps and the crystal structures of 
T. thermophilus* and T. maritima SecYE?. A density map for tracing the translocating 
peptide was generated by combining phase information from MAD phasing, 
and models of SecA, nanobody, and SecYE (Extended Data Fig. 4c). The signal 
sequence was initially modelled as an ideal poly-Ala helix and placed into the den- 
sity map. The registry of the signal sequence helix was then determined from the 
density for two aromatic residues (Phe(—7) and Tyr(—1)). The following 23 amino 
acids of the translocating peptide were traced in a B-factor-sharpened density map. 
The registry of this segment was determined on the basis of density for an aromatic 
residue (Phe3), several negatively charged residues (Glu4, Asp5, Glu10, and Glu12) 
surrounding the positively charged TagBrj2 cluster, and the cysteine engaged in 
the disulfide bond (Cys7). Our model places Gly19 inside the pore ring, but it 
is possible that the registry is off by one residue (Ala18 or Gly20 would be in the 
pore), an ambiguity that does not alter the interpretation of the model. Annealing 
to a temperature of 2,500 K was applied to the model at an early stage of refinement 
with the program Phenix*®. The individual XYZ and group B factors were refined 
by using both the Phenix and Refmac5 (ref. 46) programs. Secondary structure 
was tightly constrained during refinement. The single-wavelength anomalous dif- 
fraction (SAD) likelihood function implemented in Refmac5 was used to refine 
the heavy atom clusters together with the protein model. The model was improved 
by molecular replacement and multiple cycles of model building and refinement. 
Manual adjustments were made in COOT”’ and refinement was performed in 
reciprocal space. At a late stage of refinement, the diffraction data were extended 
to 3.7A, which helped to resolve some regions in the density map, for example 
of the translocating peptide. We also tested the data with ellipsoidal truncation 
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processed by the Anisotropy server (http://services.mbi.ucla.edu/anisoscale/). 
The truncated data produced better density maps, which were used for refinement 
as well. The final model was refined to an Ryork Value of 30.6% and an Ree value 
of 32.5% and showed good geometry (Extended Data Table 1). An attempt 
was made to include the model-refined TagBrj2 clusters as the resolved heavy 
atom substructure for recalculation of SAD and MAD experimental phases. 
However, this did not improve the map, in part because individual metal atoms 
of the clusters could not be accurately positioned, as observed in other cases of 
similar resolution*®. While the centres of the clusters are well defined, the individual 
metal atom positions in our model should be considered to be very approximate. 
To ensure that the R-factor is not dominated by the heavy metal ion clusters, we 
tested its sensitivity to changes in the protein model. Rfree increased by 0.013-0.017 
upon deletion of any of the TMs of SecY or of the signal sequence helix. A similar 
increase was observed when the same analysis was performed for three different 
membrane proteins of similar size (2ZD9, 4CZB, 4CDI; 1,078-1,655 amino acids), 
the structure of which was determined in the absence of heavy metal ion clusters at 
similar resolution (3.5-4 A), solvent content (0.63-0.85), and R-factor (0.3-0.34). 
Model validation was performed by using PHENIX. The following regions could 
not be traced: residues 1-15, 244-247, 262-264, 271-272, 620-626, and 635-712 
of SecA; residues 1-12, 145-146, 200-213, 244-260, and 291-308, 390-391, and 
396-398 of SecY; residues 1-3, 22-23, 58-60 of SecE; residues 30-31, 42, and 
100 of the nanobody; and the first five residues of the linker preceding the signal 
sequence. In addition, some amino-acid side chains were not well resolved, so they 
were modelled as Ala. Some density close to metal ion cluster sites 15-18 remains 
unexplained. Figures showing the structures were generated with Chimera®. All 
the X-ray crystallographic software was maintained by SBGrid®”. 

Accession numbers of structures used in this paper. The accession numbers 
used are PDB 1RH5, 3MP7, 2ZJS, 3DIN, 5AWW, 3JC2, 4CG6. The new structure 
described in this paper has accession number 5EUL. 
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Extended Data Figure 1 | Generation of translocation intermediates 
with a three-component system. a, Strategy to generate SecA- 
dependent translocation intermediates in E. coli cells. The intermediates 
are assembled from E. coli SecA, E. coli SecY complex, and substrate 
containing an N-terminal OmpA signal sequence and C-terminal 
superfolder GFP*! (sfGFP). After loop insertion into the SecY channel, 
translocation of the C terminus is stalled by the folded sfGFP. Insertion 
is monitored by disulfide bond formation between a pair of cysteines 
introduced into the substrate and the plug of SecY (yellow stars). 

b, Scheme showing the simplified system, in which a secretory protein 
segment is fused into the two-helix finger of SecA. c, Sequence of the 
substrate used in a. The —1 position of the original signal sequence was 
changed to Tyr to prevent signal sequence cleavage. The position of the 
cysteine and the length of the translocated segment were varied (here 
shown for Cys at position +3 and 58 amino acids in length). d, Variation 


OmpA-GFP 
x SecY 


SecY 


<= OMpA-GFP grep X SECY His 


WB: Anti-His 


OmpA-GFPxtrep X SECY pis, 


OmpA-GFP strep 
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of the Cys position with a translocationg segment of 58 residues. 

Where indicated, disulfide crosslinks to SecY with a Cys at position 68 
(OmpA-GFPxSecY) were induced by the oxidant copper phenanthroline 
(CuPh) before harvesting the cells. The samples were analysed by non- 
reducing SDS-PAGE, followed by western blotting (WB) with anti-SecY 
and anti-GFP antibodies. e, As in d, but in the absence of oxidant, with 
Cys at different positions and variation of the length of the translocated 
segment. Asterisks indicate non-specific bands. f, As in e, but with E. coli 
cells expressing B. subtilis (B.s.) SecA and G. thermodenitrificans (G.t.) 
SecYE. The substrate contained a Cys at position +7, and SecY a Cys at 
position 60. The red arrows indicate spontaneously generated disulfide 
crosslinks (GFP sometimes does not unfold in SDS, resulting in two 
bands). The OmpA-GFP constructs contained a C-terminal Strep-tag 
that was detected by StrepTactin conjugated with horseradish peroxidase 
(HRP). 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Generation of translocation complexes with 
SecA-substrate fusion constructs. a, Translocation complexes were 
generated as indicated in the scheme in Extended Data Fig. 1b. An E. coli 
SecA-substrate fusion (SecA-OAIns74 (L12)) was overexpressed together 
with E. coli SecY complex in E. coli cells. SecA-OAIns74 (L12) contains 
74 amino acids inserted into the two-helix finger of SecA, including a 
linker of 12 residues, and a GFP tag following SecA. Translocation of the 
substrate segment was monitored by spontaneous disulfide crosslinking 
between a cysteine at position +7 (with respect to the original signal 
peptidase cleavage site) and a cysteine at position 68 in the plug of SecY. 
Where indicated, }-mercaptoethanol (3-ME) was added to reduce the 
disulfide bond. The samples were analysed by non-reducing SDS-PAGE 
and western blotting with anti-SecY antibodies. The overexpression of 
SecA-OAIns was monitored by the fluorescence of GFP (data not shown) 
and staining with Coomassie blue (CBB, lower panel). b, As in a, but with 
E. coli SecA-OAIns constructs containing from 6 to 12 residues in the 


linker (L6-L12) or mutations (3Q) in the H-region of the signal sequence. 


Expression of SecA-OAIns was verified by the strong green fluorescence of 
cell lysates, caused by GFP fused to the C terminus of SecA (not shown). 
The lower panel shows the sequences of the SecA-inserted segments. c, As 
ina, but with B. subtilis SecA-OAIns74 (L12) and G. thermodenitrificans 
SecYE. SecY and SecA were detected by western blotting with anti-His 
antibodies and Ponceau staining, respectively. d, As in c, but with 

B. subtilis SecA-OAIns containing different inserted segments. SecA- 
OAIns was expressed under different conditions, as indicated. Expression 
of SecA-OAIns was verified by green fluorescence of GFP fused to the C 
terminus of SecA (not shown) and Ponceau staining (second panel). The 
sequences of the constructs are shown in the lowest panel. e, As in d, but 
with different constructs, the sequences of which are shown in the lower 
panel. Where indicated, copper phenanthroline (CuPh) was added to the 
cells to induce disulfide bridge formation. SecA-OAIns49(L7) was used for 
crystallization. f, As in e, but with a Cys at position —1 (the last residue of 
the OmpA signal sequence) instead of position +-7. Note that in this case 
disulfide formation does not occur spontaneously. 
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Extended Data Figure 3 | Purification of an active translocation purification procedure and of fractions indicated with red numbers 
complex. a, Scheme of the purification protocol. b, Elution of the in b. Lane 1, molecular mass markers. Lane 2, sample analysed after 
G. thermodenitrificans SecYE/B. subtilis SecA-OAIns complex from immobilized metal ion affinity chromatography. Lane 3, sample 
a Superdex200 column during the last chromatography step. after cleavage of the GFP tag. Lane 4, sample after anion exchange 
c, Non-reducing SDS-PAGE analysis of samples taken during the chromatography (MonoQ). 
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Extended Data Figure 4 | Electron density map and refined model. for the translocating peptide. Right: map calculated with the model. Phe 


a, Stereo view of the unsharpened density map (2F, - F.; 1c) of the entire (—7) is one of the residues used to determine the registry of the helix. 
complex. Heavy metal ion clusters are shown in yellow. b, As ina, but with — d, A side view of the density for TM3 and TM4. e, Density showing the 

the density map derived from MAD phasing after density modification. disulfide crosslink between the plug and translocating chain. f, Top view of 
c, SigmaA-weighted phase-combined 2F, - F, density maps of the Gly19 of the translocating chain surrounded by pore residues. 


translocating peptide region. Left: omit map calculated without a model 
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Extended Data Figure 5 | Interactions of the nanobody and SecA with with the loop between TM2 and TM3 (L2/3; in red). The loop between 
SecY. a, The nanobody binds to the plug and to the loop between TM3 and = TM6 and TM7 of SecY (L6/7; in red) does not seem to make strong contact 
TM4 (L3/4). b, The polypeptide crosslinking domain (PPXD; in yellow) with SecA. c, Two helices of the HSD interact with the C-terminal tail of 
of SecA interacts with the loop between TM8 and TM9 of SecY (L8/9; SecY (C-tail; in red). 

in red), and the long helix of the helical scaffold domain (HSD; in blue) 
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Extended Data Figure 6 | Comparison of the conformations of domain (HSD), brown; helical wing domain (HWD), grey; polypeptide 
SecA in the active G. thermodenitrificans and inactive T. maritima crosslinking domain (PPXD), yellow). SecA in the T. maritima complex is 
complexes. The domains of SecA in the G. thermodenitrificans complex shown in pink. Left: a top view (the channel would be underneath); right: 


are labelled with different colours (nucleotide binding domain 1 (NBD1), a side view with the two-helix finger (part of helical scaffold domain) 
blue; nucleotide binding domain 2 (NBD2), cyan; helical scaffold indicated. 
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a G. thermodenitrificans b G. thermodenitrificans 
vs T. thermophilus (idle) vs M. jannaschii (idle) 


Extended Data Figure 7 | Localization of signal sequences in the the amphipathic helix are indicated. e, The active G. thermodenitrificans 
G. thermodenitrificans SecY and mammalian Sec61 channels and of a channel (in salmon) was aligned with a mammalian channel (in grey) 
TM domain in the mammalian Sec61 channel. a, In the active channel containing a nascent membrane protein (PDB accession number 4CG6) 
(salmon), the signal sequence displaces TM7 and TM68 in the idle using secondary structure matching’. The signal sequence in the 

T. thermophilus channel (cyan). b, As in a, but comparison with the idle bacterial channel is shown in green, and the TM segment of the nascent 
M. jannaschii channel (tan). The C-region of the signal sequence takes membrane protein in yellow. f, As in e, but comparison of the active 

the position of TM7. c, Side view of the interactions of the H-region of G. thermodenitrificans channel with a mammalian Sec61 channel (light 
the signal sequence with TM2 of G. thermodenitrificans SecY. Interacting blue) containing a secretory protein fragment (PDB accession number 
amino acids are indicated. d, Stereo view showing the intercalation of 3JC2). The signal sequences are shown in green and yellow, respectively. 


the C-region into the periplasmic side of the lateral gate. Residues of 
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a G. thermodenitrificans 7 fannaschii P. furiosus | 


T. thermophilus T. maritima 
2ZJS 3DIN 


Hydrophilic Hydrophobic 
Extended Data Figure 8 | The plug domains in SecY channels. a, The omitted. b, In the inactive T: maritima SecY channel, the plug (in yellow) is 
plugs in channels of different organisms have different structures. Shown at the front of the channel, partly sealing the periplasmic side of the lateral 
are side views with the plugs in yellow and pore residues as red spheres. gate. Shown is a side view in a surface representation, with hydrophilic and 
PDB accession numbers are given below the names of the organisms. For hydrophobic residues in blue and orange, respectively. 


the G. thermodenitrificans channel, the translocating peptide segment was 
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Extended Data Figure 9 | Structures of the active G. thermodenitrificans 
complex determined without nanobody or with a different signal 
sequence. a, Stereo view of density maps at 6.5 A resolution for the active 
complex in the absence of nanobody. Shown is a 2F, - F, density map at 

1a (blue mesh) and a difference map (F, - F,) at 3o (green mesh), both 
calculated by molecular replacement with a model lacking the plug. Strong 
positive density is seen close to SecE, probably corresponding to parts of 
the plug. The arrow indicates the movement of the plug from the position 
in the structure with nanobody to the density seen in the structure without 


nanobody. b, Stereo views of density maps at ~8.5 A resolution for the 
active G. thermodenitrificans complex in which the OmpA signal sequence 
was replaced by that of DsbA. Shown is a side view of the 2F, — F. density 
map at lo (blue mesh) and a difference map (F,- F.) at 30 (green mesh), 
both calculated by molecular replacement with a model lacking the signal 
sequence. Note that the model for the OmpA signal sequence fits well into 
the density corresponding to the DsbA signal sequence. c, As in b, but top 
view and not in stereo. 
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Extended Data Table 1 | Data collection and refinement statistics for MAD structures 
SecA-OAIns/SecYE + 


TagBrj> 

Data collection GM/CAT-CAT @APS 
Space group P6,22 
Cell dimensions 

a, b,c (A) 127.798,127.798,554.772 

a, By (°) 90, 90, 120 

Remote Inflection Peak 

Wavelength 1.2782 1.2552 1.2548 
Resolution (A) 3.70 (3.80-3.70) 4.00 (4.10-4.00) 4.50 (4.62-4.50) 
Ryym OF Rmerge 0.084 (>1) 0.095 (>1) 0.128 (1) 
I/ol 9.30 (0.4) 9.69 (0.9) 11.90 (1.0) 
Completeness (%) 99 (100) 99 (100) 99 (100) 
CC1/2 0.999 (0.57) 0.998 (0.80) 0.999 (0.92) 
Redundancy 10.6 (10.2) 10.6 (10.4) 21.3 (21.6) 
Refinement 
Resolution (A) 3.7 
No. reflections 53812 
Ryorks Riree 0.306/0.325 
No. atoms 9888 

Protein 9532 

Ligand/ion 356 

Water N/A 
B-factors 194.00 

Protein 190.24 

Ligand/ion 300.34 

Water N/A 
R.m.s deviations 

Bond lengths (A) 0.007 

Bond angles (°) 1.15 
Ramachandran 
Favored / allowed / 91.3/6.8/ 
outliers (%) 1.9 


*Highest resolution shell is shown in parenthesis. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16135 


Corrigendum: Domains of 
genome-wide gene expression 
dysregulation in Down’s syndrome 


Audrey Letourneau, Federico A. Santoni, Ximena Bonilla, 
M. Reza Sailani, David Gonzalez, Jop Kind, Claire Chevalier, 
Robert Thurman, Richard S. Sandstrom, Youssef Hibaoui, 
Marco Garieri, Konstantin Popadin, Emilie Falconnet, 
Maryline Gagnebin, Corinne Gehrig, Anne Vannier, 

Michel Guipponi, Laurent Farinelli, Daniel Robyr, 

Eugenia Migliavacca, Christelle Borel, Samuel Deutsch, 
Anis Feki, John A. Stamatoyannopoulos, Yann Herault, 

Bas van Steensel, Roderic Guigo & Stylianos E. Antonarakis 


Nature 508, 345-350 (2014); doi:10.1038/nature13200 


Owing to a labelling error in the input files, one of the two replicate 
data sets used for Fig. 5d and e and Supplementary Fig. 6d of this 
Article was incorrect. We have now repeated the analysis with a correct, 
independent replicate experiment. This confirms our previous conclu- 
sion that there are no detectable differences in nuclear lamina inter- 
actions between the normal and trisomy 21 twin cells. Therefore, our 
conclusions remain unaffected. Figure 1 of this Corrigendum shows 
the corrected panels d and e of Fig. 5. The Supplementary Information 
to this Corrigendum shows the corrected panel d of Supplementary 
Fig. 6. The correct data are in the Gene Expression Omnibus under 
accession number GSE55289. 
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Figure 1 | This shows the corrected panels d and e of the original Fig. 5. 


Supplementary Information is available in the online version of the 
Corrigendum. 


400 | NATURE | VOL 531 | 17 MARCH 2016 
© 2016 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16470 


Corrigendum: Hallmarks of 
pluripotency 


Alejandro De Los Angeles, Francesco Ferrari, Ruibin Xi, 
Yuko Fujiwara, Nissim Benvenisty, Hongkui Deng, 

Konrad Hochedlinger, Rudolf Jaenisch, Soohyun Lee, 
Harry G. Leitch, M. William Lensch, Ernesto Lujan, 
Duanqing Pei, Janet Rossant, Marius Wernig, Peter J. Park & 
George Q. Daley 


Nature 525, 469-478 (2015); doi:10.1038/nature15515 


In this Review, a sentence was added at proof stages and we inadvert- 
ently omitted a citation to a study from the laboratory of Jacob Hanna’. 
This reference citation should have appeared associated with the sen- 
tence: “The observation that naive cells tolerate depletion of epigenetic 
regulators supports the concept of naive pluripotency as a configura- 
tion with a reduced requirement for epigenetic repression compared 
to primed PS cells and somatic cells!” 


1. Geula, S. et al. mMS&A mRNA methylation facilitates resolution of naive 
pluripotency toward differentiation. Science 347, 1002-1006 (2015). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16479 


Corrigendum: Failure to replicate 
the STAP cell phenomenon 


Alejandro De Los Angeles, Francesco Ferrari, Yuko Fujiwara, 
Ronald Mathieu, Soohyun Lee, Semin Lee, Ho-Chou Tu, 
Samantha Ross, Stephanie Chou, Minh Nguyen, 

Zhaoting Wu, Thorold W. Theunissen, Benjamin E. Powell, 
Sumeth Imsoonthornruksa, Jiekai Chen, Marti Borkent, 
Vladislav Krupalnik, Ernesto Lujan, Marius Wernig, 

Jacob H. Hanna, Konrad Hochedlinger, Duanqing Pei, 

Rudolf Jaenisch, Hongkui Deng, Stuart H. Orkin, Peter J. Park 
& George Q. Daley 


Nature 525, E6-E9 (2015); doi:10.1038/nature15513 


During extensive revisions of this BCA, we inadvertently omitted 
a citation by Takaho A. Endo that used variant calls from RNA-seq 
data to conclude that the purported Fgf4-induced stem cells (FI-SCs) 
described in Obokata et al.' constituted a mixture of trophoblastic and 
embryonic stem cells”. Our analysis, performed independently, reached 
similar conclusions. We regret this oversight. 


1. Obokata, H. et a/. Bidirectional potential in reprogrammed cells with acquired 
pluripotency. Nature 505, 676-680 (2014). 

2. Endo, T. A. Quality control method for RNA-seq using single nucleotide 
polymorphism allele frequency. Genes Cells 19, 821-829 (2014). 
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LIVING FACTORIES OF 
THE FUTURE 


Scientists are designing cells that can manufacture drugs, food and materials — and even 
act as diagnostic biosensors. But first they must agree ona set of engineering tools. 


Japanese company Spiber Inc. has reprogrammed bacteria to make spider silk, which is being used to make clothing. 


BY MICHAEL EISENSTEIN 


no business producing a pain killer. But 

by re-engineering the microbe’s genome, 
Christina Smolke at Stanford University in Cali- 
fornia has made it do precisely that. Smolke and 
her team turned yeast into a biofactory that, by 
starting with sugar as a raw ingredient, makes 
the potent pain-relief drug hydrocodone’. 

This feat is a prime example of synthetic biol- 
ogy, in which scientists reprogram cells to repli- 
cate products found in nature — or even make 
more-specialized materials that would never 
normally be produced by a natural organism. 

Synthetic biologists are ambitious. “We'd 
all love to imagine a world where we could 
adapt biology to manufacture any prod- 
uct renewably, quickly and on demand,” 


he an evolutionary perspective, yeast has 


says Michael Jewett, a synthetic biologist at 
Northwestern University in Evanston, Illinois. 
Groups around the world are engineering yeast, 
bacteria and other cells to make plastics, bio- 
fuels, medicines and even textiles, with the goal 
of creating living factories that are cheaper, sim- 
pler and more sustainable than their industrial 
counterparts. For instance, the biomaterials 
company Spiber Inc. in Tsuruoka, Japan, has 
reprogrammed bacteria to churn out spider silk 
for use in strong, lightweight winter clothing. 
But synthetic biologists are going beyond 
simply producing materials — they are creating 
complex systems by ‘wiring up’ genetic parts 
into circuits. This approach has already resulted 
in various living switches and sophisticated 
sensors. For example, Martin Fussenegger’s 
group at the Swiss Federal Institute of Tech- 
nology (ETH) in Zurich has built biomedical 


sensors that can detect disease-relevant metab- 
olites in the blood and trigger the production 
of therapeutic compounds. In mice, these bio- 
sensors successfully staved off gout and obe- 
sity, and treated the skin disease psoriasis’ (see 
‘Living pills’). 

This young field has already spawned some 
success stories, but making and putting together 
genetic parts currently involves substantial 
guesswork and unpredictability. For the field to 
advance, academics and industrial players must 
agree on a toolbox of reliable genetic parts and 
the best strategies for assembling them. 

To build an artificial product, synthetic 
biologists begin by selecting DNA parts on 
a computer and manufacturing them with 
specialized instruments. The parts can then 
be inserted into the DNA of microorganisms 
and cells to reprogram them. 
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UAH OREN SYNTHETIC BIOLOGY 


» Thanks to the plummeting cost of DNA 
sequencing, there is now a vast collection of 
genetic data through which synthetic biolo- 
gists can sift to find useful genes. “Biology has 
given us this big, crazy library of stuff to choose 
from,’ says Christopher Voigt, a synthetic biolo- 
gist at the Massachusetts Institute of Technology 
(MIT) in Cambridge. One leading database, the 
US National Center for Biotechnology Informa- 
tion’s GenBank, contains more than 190 million 
DNA sequences from 100,000 organisms. 

Some of the most widely used genetic parts 
encode enzymes — proteins that are essential 
for manufacturing. To transform glucose into 
hydrocodone, for example, Smolke’s team 
took 23 enzyme-encoding genes from diverse 
species and put them into yeast’. 

Other favourites in the genetic designer's 
palette are promoters — stretches of DNA that 
regulate the activity of nearby genes and cause 
them to be expressed. When proteins called 
transcription factors bind to a promoter, the 
process of transcribing a gene begins. But pro- 
moters operate too slowly for some synthetic- 
biology applications. “We're trying to build 
things that operate fast — on millisecond time- 
scales,” says biologist Pamela Silver of Harvard 
Medical School in Boston, Massachusetts. 
Scientists are therefore examining alternative 
mechanisms that allow gene expression to be 
controlled directly by signals in the environ- 
ment, such as toxins or antibiotics. 

With myriad synthetic DNA pieces at their 
disposal, synthetic biologists can indulge their 
creativity. Voigt is enthusiastic about the pos- 
sibilities: “The nice thing about biology is that 
there are lots of ways to do the same thing — 
and as an engineer, you can pick the way that 
is easiest to design.” But genetic parts must 
perform consistently if the goal of setting up 
industrial processes is to be realized. “One of 
the key problems for biology in general is the 
lack of reproducibility,’ says Richard Kitney, 
chairman of the Institute of Systems and Syn- 
thetic Biology at Imperial College London. “In 
synthetic biology this is totally unacceptable — 
you have to have reproducibility if you're going 
to do industrial translation.” 

Many researchers deposit their discoveries 
into shared repositories, such as the Registry 
of Standard Biological Parts and the Inventory 
of Composable Elements. But those parts are 
often poorly defined or lack crucial informa- 
tion about how they were experimentally 
tested. “The only quality control is in the per- 
son who deposited the information,’ says Voigt. 

The US National Institute of Standards and 
Technology (NIST) launched the Synthetic 
Biology Standards Consortium in March 2015 
with the aim of standardizing the design, docu- 
mentation and assembly of synthetic-biology 
parts across academic institutions and industry. 
In the United Kingdom, Kitney is coordinating 
a similar effort in which the DICOM (Digital 
Imaging and Communications in Medicine) 
standard for sharing medical information will 


LIVING PILLS 


Scientists have made engineered cells that can detect flare-ups of the skin disease psoriasis and dispense 
on-the-spot treatment. The implanted cells are housed in gel capsules that protect them from the host's 
immune system and that stop the cells from attacking the host if they malfunction. 


Immune cell 


Pro-inflammatory 
cytokines (TNF, IL-22) 


if PSORIATIC LESION 


Skin cells multiply excessively, 
attracting immune cells to the 
site of inflammation, where they 
produce the pro-inflammatory 
cytokines tumour necrosis factor 
(TNF) and interleukin-22 (IL-22). 


Cell 


be expanded to include synthetic biology’. In 
parallel, an international team has developed 
SBOL (Synthetic Biology Open Language)’ to 
provide researchers with a standardized vocab- 
ulary to describe genetic parts and circuits. 


CELLULAR SOFTWARE 

Thanks to greater automation, it is now simpler 
and cheaper than ever before to make synthetic 
DNA parts (see ‘Manufacturing DNA has never 
been easier’). But connecting those parts to form 
genetic circuits that can work together to pro- 
vide sophisticated, computing-like behaviours 
is still a challenge. “Any time you physically 
connect DNA youre creating a new sequence 
at that interface — as DNA is so information- 
rich, you could create a new promoter or change 
the beginning of the RNA,’ says Voigt. 

Even carefully designed circuits can mal- 
function and cause unwanted expression ofa 
gene or interference between the genetic ele- 
ments in the biological circuit — outcomes 
that cannot be foreseen in computer models. 
“The community is very much operating in a 
world where we cannot predict what is going 
to happen in our systems when we build 
them,” says Reshma Shetty, co-founder of the 
synthetic-biology company Ginkgo Bioworks 
in Boston, Massachusetts. 

This uncertainty means that many of the 
steps in engineering a synthetic system need 
to be tested and optimized. Software tools 
and robotics are speeding up each part of this 
process, from building the artificial DNA to 
inserting it into a microbe. “You can use high- 
throughput prototyping to just build every 
variant, and hopefully one of them will hit? 
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Genetic circuit 


Yq DESIGNER CELL 


Implanted engineered cells constantly monitor the blood for 
signs of disease. The cell’s genetic circuit acts as an AND 
gate: when it detects both TNF and IL-22, the cell releases 
IL-4 and IL-10 into the blood to soothe the inflammation. 


Gel capsule 


Anti-inflammatory cytokines 
(IL-4, IL-10) 


Pore 
KE RESTORED SKIN 


The inflammation subsides, the 
designer cell becomes inactive, 
and the skin returns to normal. 


says Jay Keasling, a biochemical engineer at 
the University of California, Berkeley, and a 
pioneer in the field. The push for automation 
has led a number of synthetic-biology research 
centres and firms to install ‘biofoundry’ facili- 
ties in which robotic assembly lines create, test 
and optimize microbes at a much larger scale 
than could be done by hand. 

Biofoundries are enabling synthetic biolo- 
gists to embark on ambitious projects. For 
example, Voigt, who co-directs the MIT-Broad 
Foundry, cites a collaboration with the Swiss 
pharmaceutical company Novartis to manu- 
facture a huge range of molecules that are pro- 
duced by bacteria in the human gut. 

Other institutions pursuing the biofoundry 
model include the SynbiCITE programme 
at Imperial College London and the National 
University of Singapore's Synthetic Biology 
Foundry. The US Defense Advanced Research 
Projects Agency (DARPA) has also invested 
heavily in the MIT-Broad facility, including a 
five-year, US$32-million contract that began in 
October 2015. 

Some biologists remain sceptical about the 
rush to scale up and automate, and favour a 
more theory-driven strategy. But Kitney, who 
co-directs SynbiCITE, considers automation 
to be an inevitable step in the evolution of syn- 
thetic biology. “You can rapidly run a whole 
series of experiments in parallel to see which 
configuration works best,’ he says. 


THE PERFECT HOST 

Species that are commonly used as model 
organisms in the lab, such as brewer’s yeast 
(Saccharomyces cerevisiae) and the bacterium 


ADAPTED FROM REF. 2 


ROY KALTSCHMIDT/BERKELEY LAB 


Escherichia coli, have also been pressed into 
service by synthetic biologists. Many break- 
throughs in biosynthesis have been achieved 
with these organisms, such as when Keasling 
and his collaborators at Amyris, a company 
that he co-founded in Emeryville, Califor- 
nia, in 2003, reprogrammed S. cerevisiae to 
manufacture the antimalarial compound 
artemisinin’. 

But these common lab organisms are not 
necessarily suited to being grown on an indus- 
trial scale. The hunt for better alternatives has 
led scientists to search in obscure places. “More 
and more labs are taking on arcane organisms 
— I think the S. cerevisiae and E. coli domi- 
nance is dropping,” says Voigt. 

In some cases, the choice organisms will be 
those that can withstand harsh manufactur- 
ing conditions, says Keasling. “Maybe you're 
producing something that’s toxic but volatile, 
so if you have an organism that can produce 
it at relatively high temperatures, you could 
boil it off while you're producing it.” Scientists 
are also testing whether it is possible to feed 
microbes with carbon sources other than sug- 
ars to make products. Synthetic-biology com- 
pany Intrexon in Germantown, Maryland, is 
working with bacteria that feed on methane, a 
cheaper and more efficient means for produc- 
ing carbon-based products than is sugar. 


MEDICAL CELLS 
When it comes to medical applications, syn- 
thetic biologists are engineering mammalian 
cells rather than microbes. Such designer cells 
could produce drugs in response to disease or 
take over certain physiological tasks in peo- 
ple with metabolic disorders such as diabetes. 
But engineering mammalian cells introduces 
anew set of challenges. “All the tools we have 
in yeast are just not there in mammalian cells,” 
says Smolke. “We don’t have as many promot- 
ers, or tools for regulation 


of gene expression or pro- “It will come 
tein modification” to the point 

The easiest cells to cul- wher e you 
tivate are tumour-like, Canjust 
immortalized cell lines, inexpensively 
which are inherently synthesize 
‘defective’ and there- ftheDNAyou 
fore not representative need.” 


of healthy tissues. Con- 

versely, tissue-derived primary cells are hard 
to cultivate and manipulate, and differences 
between cell types confound efforts to build 
toolkits that can be applied across the body. 
“Something that works in a kidney cell will 
not necessarily work in the lung or liver,’ says 
Fussenegger. To get around this, the ETH 
team is engineering ‘prosthetic gene circuits, 
which are introduced into host cells that can 
be implanted at the site of disease. 

Tinkering with genomes can also pre- 
sent problems. Even ‘smart’ genome-editing 
tools — such as CRISPR-Cas9, a system for 
introducing targeted modifications at specific 
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GENES TO ORDER 


Manufacturing DNA has never been easier 


In the early days of synthetic biology, 
scientists had BioBricks — a genetic-part 
format that was conceived by Thomas Knight 
at the Massachusetts Institute of Technology 
in Cambridge and designed for modular 
assembly. It was an attractive concept, but 
stringing these short bits together to make 
larger circuits proved to be a laborious and 
potentially error-prone process. 

The task of assembling the pieces is 
now much easier because newer DNA 
synthesis machines can churn out strings 


arn 


Jay Keasling with a Biomek FX’ lab robot. 


DNA sites — can have unpredictable out- 
comes. “We don't know enough loci in human 
cells where you can put things in without any 
interference,’ says Fussenegger. His team is 
exploring whether it is possible to avoid this 
uncertainty by introducing gene networks 
that are embedded in synthetic loops of DNA 
known as plasmids rather than integrated 
directly into chromosomes. As an extra pre- 
caution, his experiments with mice gener- 
ally make use of engineered cells trapped in 
implanted capsules, rather than modifying the 
animal's tissues. 

Others want to do away with the cell alto- 
gether. Jewett is studying cell-free systems in 
which bacterial extracts are purified to obtain 
only the ‘useful’ parts of the cellular machin- 

ery. “You get all the enzymes necessary for 
energy and cofactor regeneration as well as 
protein synthesis,” says Jewett. “This gives you 
unprecedented freedom to directly manipulate 
reaction conditions.’ This allows researchers to 
establish chemical conditions that maximize 
manufacturing productivity without worrying 
about keeping cells healthy. Jewett’s team has 
shown that this approach can efficiently churn 
out medically useful proteins such as erythro- 
poietin’, a hormone that stimulates red-blood- 
cell production. “It’s not yet a replacement for 
existing technologies, but the yields are suffi- 
cient to serve as a complement,” he says. 


of several thousand base pairs rather than 
just a few hundred, which cuts down on the 
errors introduced by the assembly process. 
“You can just order a bunch of predictably 
designed constructs, so you don’t even have 
to think about the modularity anymore,” 
says Pamela Silver of Harvard Medical 
School in Boston, Massachusetts. The 
cost of making DNA parts has also fallen 
dramatically — by as much as 85% between 
2009 and 2014 — to the point at which 
both academic groups and companies 
routinely outsource the job to specialized 
providers such as Twist Biosciences in San 
Francisco, California, Gen9 in Cambridge, 
Massachusetts, and SGI-DNA in La Jolla, 
California. 
Many projects still require constructs 
that exceed the scale of what can be 
manufactured in one go, but these longer 
fragments can now be joined by quick 
and simple techniques that leave no scar. 
However, Jay Keasling of the University of 
California, Berkeley, thinks that even this will 
soon become a thing of the past. “It will come 
to a point where you can just inexpensively 
synthesize the DNA you need, whether it’s 
10,000 or a million base pairs.” ME. 


The field is still in its infancy — indeed, the 
earliest demonstrations of engineered genetic 
circuits appeared only in early 2000 — and it 
can be dauntingly complex. Even so, a growing 
number of scientists grounded in conventional 
molecular biology are keen to give genetic 
design a try. Synthetic biologist Ron Weiss at 
MIT is teaching an online course on the field 
that proves its popularity. “We’ve had about 
14,000 people sign up,” he says. 

The pay-off for those entering the field 
could be huge. “I’m in this space because the 
frontiers are endless for what biology can do,” 
says Shetty. “It’s just a matter of the technology 
advancing to a point where those new horizons 
open up.’ # 


Michael Eisenstein is a freelance writer based 
in Philadelphia, Pennsylvania. 
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RESEARCH DEVELOPMENT Advance cutting- 
edge science without running a lab p.406 
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Classes and seminars help trainees to get the most out of career-development tools. 


CAREER PLANNING 


Question time 


Career -development plans can point researchers in directions 
they might not have expected, but they take commitment. 


BY PAUL SMAGLIK 


workbook that asked about his scientific 

competencies and interests. When he 
indicated that he was skilled in statistical anal- 
ysis and enjoyed presenting research to a non- 
scientific audience, the programme suggested 
that he work in public policy, a field that didn't 
interest him. He tossed the results aside. 

But last year, the doctoral student, who is 
studying stem cells at New York University, 
revisited the tool as part ofa course on building 
career options. This time it led him to science 
writing, a path that resonated, and the instructor 


|: 2014, Michael Burel completed an online 


and guest speakers helped him to identify ways 
in which he could train for a career in the field. 
Since then, he has attended a science-writing 
seminar, talked to science journalists about 
how they trained for and landed their jobs 
and attended a science-writers conference. He 
now interns as a science writer for the Albert 
and Mary Lasker Foundation in New York 
City, which supports medical research. He says 
that the career workbook and related course 
have been some of the most useful aspects of 
his graduate education: together, they helped 
him to identify a viable career and guided him 
to the workshops, classes and internships that 
provided a starting point for his success. 


Burel’s experience illustrates both the 
promise and problems of the career-develop- 
ment programmes known as individual devel- 
opment plans (IDPs) in the United States and 
researcher development frameworks (RDFs) in 
the United Kingdom and mainland Europe. The 
programmes, available in hard copy and online, 
aim to help trainees to identify what aspects of 
science they like best, match them with careers 
that incorporate their interests and skills and 
identify gaps in their competencies. 

Versions of the programmes can be as elabo- 
rate as the multi-question workbook that Burel 
initially turned to, or as simple as a short con- 
versation with an adviser followed up by a writ- 
ten training plan. Either way, IDPs and RDFs 
can lead users in wrong directions and to dead 
ends when completed on their own or without 
follow-up. Junior scientists who hope to exploit 
the value of a career-development plan should 
complete them as part of a career-building 
course, discuss them with peers and an adviser 
and revisit them often (see ‘Career planner’). 

Although IDPs are commonplace in the 
private sector, the scientific workforce has 
adopted them fairly recently. Vitae, a UK-based 
organization that trains and develops research- 
ers around the world, developed an RDF in 
2009. In 2013, the US National Institutes of 
Health (NIH) recommended that principal 
investigators (PIs) use them with their post- 
docs and graduate students. The Federation of 
American Societies for Experimental Biology 
(FASEB) later created a template for the hard- 
copy version, and the American Association for 
the Advancement of Science (AAAS) launched 
an online version called myIDP. Many institu- 
tions have developed their own version. 


BREAK OUT OF BIAS 

Some trainees say that their institution’s IDP 
programme is written to aim users towards 
academia. And, they warn, ifa PI or adviser is 
not on board with other career choices, it can 
be tricky to get effective results from the pro- 
gramme. Gary McDowell, a postdoc at Tufts 
University in Medford, Massachusetts, com- 
pletes an IDP every year and discusses it with 
his PI. He says that both the programme and 
his PI are academically oriented, so he tries to 
be realistic about the plan’s empirical value. 
“Tt’s helpful to figure out conferences to go to, 
papers to plan, skills I need to be developing,” 
he says. “Ultimately, I think any reflection on 
your career goals, identifying successes in the 
past year and planning what you need in the 
next year is helpful, regardless of how you > 
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CAREER PLANNER 


How to get started on finding the right path. 


If you are a student or a postdoc and your 
mentor is unwilling or unable to help you 
with an individual development plan (IDP) 
or researcher development framework 
(RDF), here are some tips to help you get 
started and to follow through. 

@ Use an online tool such as mylIDP 
(http://myidp.sciencecareers.org) or 

RDF Planner (https://rdfplanner.vitae. 
ac.uk). These programmes work best with 
follow-up from a mentor, but the software 
can still help to identify scientific strengths 
and professional preferences and suggest 
possible careers, and it offers articles on 
how to train for them. 

@ Meet regularly with peers to collectively 
discuss your IDPs. Peers can often identify 
and offer suggestions about each other’s 
interests and transferable skills and may 
be able to point to training resources. 


> feel about your career path. But people may 
fall through the cracks.” 

There is little formal incentive to complete 
an IDP or RDE The NIH does not follow up 
on its recommendation, and not all institu- 
tions require trainees to use one. Nor is there 
a mandate for its use in any nation. Just 47% of 
the postdoctoral offices that participated in a 
survey by the US National Postdoctoral Asso- 
ciation said that they require their postdocs to 
complete an IDP, according to a 2014 report 
(see go.nature.com/awsupm). And another 
37% encourage their use, the report said. 

Ultimately, the trainee should not just 
create, but also follow through on their career- 
development plan, says Philip Clifford, asso- 
ciate dean for research at the University of 
Illinois at Chicago, who helped to develop 
both the FASEB and AAAS versions. The big- 
gest mistake users make is to consider it an 
endpoint rather than a launch pad, he adds. 
He has run some 200 career-building courses 
and workshops that incorporate the plan and 
build on its use and results. 

Lina Dimberg, who participated in one of 
Clifford’s courses, followed the instructions 
with care and found the programme fruitful. 
Asa postdoc in cancer research at the Univer- 
sity of Colorado Denver, she knew that she did 
not want to stay in academia but was unsure of 
her options. She completed an IDP with Clif- 
ford’s guidance and immediately learned that 
her strongest skills — writing grant proposals 
and papers, reading the literature and discuss- 
ing research — gave her a solid foundation for 
several science-related careers, one of which 
was medical writing. 

Clifford’s curriculum required her to set up 
meetings with scientists in occupations that 


@ Develop and tap into a broader network. 
This can help when students and postdocs 
suspect that their supervisor or mentor 
might be biased towards an academic 
career path. Attend local and regional 
meetings of professional organizations 

in your desired career area. Find alumni 
from your institution who work in the field 
(the university alumni office can help), and 
contact them. 

@ Commit to your plan. An IDP is useless 

if it sits on a shelf or in your digital device. 
Take the steps it identifies — building skills 
and expanding training through courses, 
seminars and workshops — to fill in gaps. 
@ Return to your plan regularly. Check on 
your progress and update any skills or 
experiences you've gained that could help 
in your career search. Set new goals and 
create deadlines for them. P.S. 


the IDP had pinpointed as career possibilities. 
One of those chats led to a job as a writer at 
a medical-device company after her postdoc 
ended; today, she works there as a senior scien- 
tist. Dimberg credits the IDP process and the 
workshop that supported it for helping her to 
define her career objectives and to develop the 
necessary confidence to market herself for the 
position. “The IDP opens your eyes to careers 
where you can combine your science interest 
with other interests and skills,” she says. 


ALL IN THE TIMING 

Sometimes, the programme might identify 
a good direction, but the user might not 
be quite ready at that point in time. When 
Nathan Vanderford was a graduate student 
and postdoc, his PIs encouraged him to com- 
plete an IDP that highlighted a tenure-track 
position — even though he wasn't sure that 
was the right route. “I ended up with a plan 
that I felt was not true to my desired career 
path,” he says. He then spent years in other 
careers, including science communications 
and research operations. 

Today, he has come almost full circle and 
is now a faculty member at the University of 
Kentucky in Lexington, where he hasa faculty- 
administrator post that his IDP results from 
so long ago did not quite predict. He teaches 
a career-development class that incorporates 
the programme's best principles. “The IDP I 
was forced to do has little to do with my cur- 
rent position,’ he says. “I want to give students 
a mechanism that allows them to explore 
freely any career option they want to pursue?” m 


Paul Smaglik is a freelance writer in 
Milwaukee, Wisconsin. 
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TRADE TALK 
Funding fixer 


After a PhD 

and postdoc in 
cardiovascular 
biology, Christina 
Papke moved into 
research development. 
She works at Texas 
Ae&M University in 
College Station, where 
she assists biomedical 
faculty members with grant proposals and 
helps them to form collaborations and identify 
funding opportunities. 


When did you start exploring careers beyond 
running a lab? 

The first and most difficult step was to realize 
that I did not want to be a principal investiga- 
tor. In summer 2014, a series of both difficult 
and good events — a grandparent'’s death, a 
friend’s wedding, a crime in my apartment 
complex — left me feeling like a rubber bouncy 
ball emotionally. It caused me to think, pray 
— and re-evaluate the direction of my career. 
I realized that although I enjoyed thinking 
about science, I could not see myself working 
ina lab for the next 30-plus years. 


How did you learn about your current job? 
What was extremely helpful for me was join- 
ing the American Medical Writers Associa- 
tion (AMWA) and being willing to put myself 
out there at a conference, not knowing who I 
might meet. Two people presented on being a 
grant-writing professional. That required a solid 
research background, as well as understanding 
how grants work, communicating with scien- 
tists, coordinating events and being an informa- 
tion resource. That was the combination I didn't 
know I was looking for. I realized that I could 
take my favourite aspects of research with me. 


So the session clinched it for you? 

I wouldn't have found this job if I hadn't talked 
to the presenters. One of them said, “I knowa 
place.” It was a city I hadn't been looking in. I 
joined AMWA in February 2015, went to the 
conference in April and was hired in July. 


That sounds almost preordained. 

One of my mentors said to me, it’s like put- 
ting out a lot of fishing lines. I was also doing 
informational interviews and a variety of other 
things. But during the process, it looked quite 
messy. My advice is, sometimes you don't know 
what you're looking for. And that’s okay. = 


INTERVIEW BY MONYA BAKER 
This interview has been edited for length and clarity. 
See go.nature.com/kbvz2q for more. 
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Ua SCIENCE FICTION 


BY S. R. ALGERNON 


t first, I mistook the skittering 
Axis outside my tent for thun- 

der. I jammed my pillow over my 
ears and returned my attention to the 
geosurvey data on my phone. After two 
weeks on this backwater planet, I didn't 
have even half the data I needed. I had 
timed my visit to coincide with an 
alignment of the planet’s three moons. 
According to Dr Feldman, that would 
set the atmosphere abuzz and be the 
perfect test of my hypothesis that the 
crystalline formations on the planet's 
surface were a product of the planet’s 
mesmerizing storms. 

Nobody told me that, for the locals, this 
was Synergy Month. The whole village was 
full of clanging and the squealing of pupae. 
The weather had cut me off from the orbital 
communications hub, so I couldn't even 
watch through the orbiting satellites. Worst 
of all, the locals who were supposed to help 
me carry equipment up to the mountains 
turned me away, even when I doubled the 
price. We cannot, they said. The Spirit-Storm 
gathers. The Queenling beckons. 

Explain that to my thesis committee. 

With only a few more days on-planet, I 
found a tent and set off for the mountains 
myself. I couldn't carry a full complement 
of scanners, but I could make do with the 
apps on my phone and hope to clean up the 
data afterwards. 

Outside, lightning struck close enough to 
cast shadows of exoskeletons and segmented 
limbs against the walls of my tent. A spiny 
leg tore through its fabric and impaled a cor- 
ner of my sleeping bag. 

I wriggled free, stuffed my phone into a 
trash bag and managed an insectoid version 
of running with the bulls until I lunged onto 
the rocks by the roadside. 

That was close. I recalled a quip from 
Dr Soto's Xeno-archaeology class. Out on 
the frontier, the bugs step on you. 

The insectoid that had ruined my tent 
extricated its leg and weaved through the 
crowd towards me. 

“Apologies, ’ it said. “I hope I did not injure 
you.” 

“Just a scratch or two.” 

“You should join 


> NATURE.COM the procession. The 
Follow Futures: Spirit-Catcher will 
© @NatureFutures find a spirit that will 
Ei go.nature.com/mtoodm = heal your wounds.” 
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“Td rather take my chances with the clinic 
at the spaceport;” I said, as I dusted myself off. 

“Spaceports come and go. The Spirits will 
always be with us,” said the insectoid pil- 
grim as it rejoined the procession. “Have a 
safe journey” 

Iclimbed up the hillside and watched the 
storm. Now free from the distracting foot- 
falls and echoes down below, I noticed that 
most of the lightning arced towards a ridge 
in the distance. I trudged to the top of the 
ridge, and it turned out to be the weathered 
edge of an impact crater. Lightning had 
scorched and melted the rock, leaving it 
alternately pockmarked and slick. 

I photographed the crater surface. Three 
more strikes etched a radiating network 
of lines onto my vision through my closed 
eyelids. My skin tingled. My ears rang. The 
lightning pooled or collected somehow in 
the crater. Why would lightning seek low 
ground? Was something catching it? 

Maybe I could get a dissertation out of 
that Spirit-Catcher nonsense after all. 

Iaimed my phone at the glowing spot in 
my field of vision and stepped forwards. The 
slope was steeper than I thought, but I held 
my ground. I felt eerily calm. All my training 
told me to get clear, but the crater drew the 
energy to its centre. After recording a dozen 
more strikes, I turned to walk back to the trail. 

The mud slipped underfoot. I fell to one 
knee and pushed off with my hands. As I 
regained my footing, strands like spider silk 
clung to my palms and fingertips. 

Lightning flashed once more. The 
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strands glowed. Every muscle in my body 
convulsed and I tumbled backwards. As my 
head struck the ground, pain sent me into 
a nauseated daze. As I fell, fibres gathered 
into a cocoon around me — a shimmering, 
silvered Faraday cage. 


I awoke at the bottom of the crater, 
hanging from a web by strands that 
spanned the crater floor. They sup- 
ported my weight like marionette 
strings. The cocoon enveloped each 
leg, arm and finger individually, so it 

did not encumber my movement. 

Lightning struck. The strands of the 
thick web above me glowed as the current 
passed overhead. 

A rustling sound caught my ear, and I 
turned to see a twelve-legged arachnoid 
shape emerge from an alcove in the shad- 
ows. As it approached, my arms pulled 
tight against my chest in a dead-Pharaoh 
pose. The strands connecting me to the web 
pulled taut, lifting me off the ground. 

“You must be the Spirit-Catcher; I said, 
grateful that the cocoon permitted me to 
speak. 

“When the others arrive,” said the Spirit- 
Catcher, “do as I instruct you and do not say 
a word.” The cocoon tightened around my 
neck and then released. “I will know if you 
try to deceive me.’ 

The Spirit-Catcher inspected the corners 
of the web, which held hundreds of battered 
machines and dozens of non-human skel- 
etons. 

The web that confined me disgorged a 
dented, blood-flecked medi-bot, which 
rolled towards me. 

“You can use this, yes?” 

I nodded. All the field researchers had to 
learn as part of their training. 

The Spirit-Catcher seemed satisfied and 
scuttled back to the alcove. The lightning 
stopped. I heard a regal voice echo through 
the chamber. 

“All praise the Spirit-Catcher. Bring forth 
the injured and infirm.” 

I told myself they would let me go once 
the procession had left, but the pilgrim’s 
words stuck with me. 

“Spaceports come and go. The Spirits will 
always be with us.” = 


S. R. Algernon studied fiction writing 

and biology, among other things, at the 
University of North Carolina at Chapel Hill. 
He currently lives in Singapore. 
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he world is in the midst of the largest wave of urban 

growth in history. With more than half the population 

already living in cities, and further rises expected, the 
health of city dwellers is crucial to global well-being. 

Throughout history, many of the innovations designed to 
improve city living have instead posed a challenge to having a 
healthy, happy life (page S50). Urban environments designed 
around the car, for instance, promote a sedentary lifestyle (S52). 

Faced with shifting climates, urban policymakers are 
preparing to deal with an increased risk of flooding (S54) — the 
replacement of concrete with more absorbent surfaces may be 
part of the solution. Evidence is also building that green space, 
although scarce, is a valuable resource for improving mental 
health — especially in poorer communities (S56). 

Health inequality is the bane of many countries. A community 
project in California shows how tackling stress in deprived 
neighbourhoods can improve the health of people whose life 
expectancy often lags behind that of their richer neighbours 
(S58). Although improved living standards have reduced the 
toll of infectious disease over the years, the world’s poorest 
urbanites remain vulnerable. From cholera to Zika virus, slums 
provide the ideal place for pathogens to thrive (S61). 

Interventions to enhance urban living, such as city farming, 
are not without their pitfalls (S60). Although navigating the 
risks of urban improvement will not be easy, it is essential if we 
are to better the lives of the urban population (S64). 
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Sciences; Zhongke DingShi Environmental Engineering Co., 
Ltd; Ningbo Tianhe Aquatic Ecosystems Engineering Co., 
Ltd; and Huilv Landscape Construction Development Co., 
Ltd in producing this Outlook. As always, Nature retains 
sole responsibility for all editorial content. 
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Neolithic people in Meso- 
potamia, now roughly 
Iraq, Kuwait and Syria, 
swap nomadic hunter-gatherer 
lifestyles for life in villages, near 
which they grow crops and keep 
livestock. These settlements attract 
vermin, insects and parasites, and 
the denser populations that they 
support encourage infectious 
diseases to emerge. Many ani- 
mal pathogens, including 

4 those that cause tubercu- 
losis and smallpox, make 
the jump to humans. 


~10,000 Bc 


~6000 Bc 


Recognizable cities appear in Mesopotamia (pictured) on the floodplains of the rivers Tigris 
and Euphrates. Other cities follow: in the Indus Valley, now Pakistan; on the banks of the 
River Nile in Egypt; and in the east along the Yellow River in China. As they grow, so does 
the production of waste. Contaminated and stagnant water trigger outbreaks of disease. 


~3200 Bc 


Shahr-e Sukhté — the Burnt City — in southeast Iran 
installs a system of pipes to supply clean water as well as 
sewers to discharge waste. In the Indus Valley, the grid- 
based cities of Harappa and Mohenjo Daro manage 
waste with drains in the streets and brick-lined sewers 
(pictured). Most houses have a private well and toilet. 


100 


The population of Rome reaches 1 million. According to the Greek geographer Strabo, while the 
Greeks build beautiful cities, the Romans focus on “paving their roads, constructing aqueducts, and 
sewers.” Aqueducts provide more than 1,000 litres of water per person, per day — far beyond the 
amount that people use today. Yet hygiene is poor, diseases are rife and mortality is high. 


1348 


Bubonic plague — the Black Death — sweeps across Europe. The pandemic: [ 

prompts many city authorities to restrict the movement of residents. 
In Italy, the cities of Venice, Florence and Lucca set up boards that i e 
have the power to impose — and enforce — quarantine. : pu al : 


ANCIENT METROPOLIS 


The city of Caral takes shape in the 
Supe Valley, Peru on the slopes of the 
Andes mountain range. The first major 
city in the Western Hemisphere, Caral 
boasts massive pyramids and exten- 
sive residential complexes that cover 
an area of 65 hectares. 


FLUSHED WITH PRIDE 


Tarquinius Priscus, the King of Rome, 
orders the construction of the Cloaca 
Maxima (pictured), or the Greatest 
Sewer, to drain the marshes that sur- 
round the city and to flush waste into 
the River Tiber. Over the centuries, the 
sewer is expanded, and it is still in use. 


DIN CITY 


The Roman satirical poet Juvenal 
writes of the “thousand perils” that 
are faced by those who live in ancient 
Rome (pictured). These include 

fire, collapsing 
buildings — 

and insomnia 
caused by night- 
time traffic. 

He states that 
the noise from 
passing wagons 
would “rouse a 
dozing seal — or 
an emperor.” 


FIERY WORDS 


English diarist John Evelyn presents 
King Charles II with Fumifugium — a 
discussion of air pollution in London, 
in which he likens the city to the “sub- 
urbs of hell”. Evelyn suggests solutions 
such as a switch to cleaner fuels. 
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MASS 
MOVEMENT 


London opens 
the world’s first 
underground rail- 
way line (pictured), 

which uses coal-fired 
steam trains. The city soon has a net- 
work of subterranean electric lines 
and trains that form part of the earliest 
mass-transport system. 


UPWARDLY MOBILE 


US architect William Le Baron Jenney 
builds the first skyscraper in the world, 
the ten-storey Home Insurance Building 
(pictured) in Chicago. Its metal frame 
allows tall struc- 
tures to be built 
without the need 
for thicker walls 
— an innovation 
that enables 
high-density city 
living. 


GREEN SHOOTS 


The UN Earth Summit in Rio de Janeiro, 
Brazil, pushes sustainability and climate 
change up the political agenda — which 
prompts leaders to take action. Plant- 
ing trees and creating green spaces are 
proposed as ways to soak up excess 
rainwater and to mop up pollutants. 


DEAD HEAT 


Europe experiences its hottest summer 
for 500 years, which kills an estimated 
70,000 people. Those in cities are the 
most affected as a result of an urban 
heat-island effect, and morgues run out 
of space as the death toll rises. 


TIPPING POINT 


More than half of the world’s popula- 
tion now lives in urban areas. In more- 
developed nations, the figure is around 
74%; in less-developed ones, 44% live 
in cities. By 2100, 85% of the world’s 
population is expected to be urban. 
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The Industrial Revolution triggers the first large wave of urbanization. Across Europe, rural 
dwellers flock to the cities that are developing around centres of manufacturing. The popula- 
tion of London soon reaches 1 million and continues to grow as more migrants come in search 
of work. Cramped housing and poor sanitation lead to epidemics and high mortality. 


Cholera rages through London's Soho district, and the outbreak is traced to a well 
that is contaminated by sewage. London’s sewage problem persists until the sum- 
mer of 1858 when the stench of the River Thames reaches Members of Parlia- 
ment in the Palace of Westminster. A network of sewers and pumps designed by 
visionary engineer Joseph Bazalgette (pictured) is hurriedly approved. After its 
completion in 1875, London never experiences another cholera epidemic. 


People converge on the World’s Columbian Exposition in Chicago, Illinois, to admire the White City. 
The architectural exhibit embodies the ‘city beautiful’ concept, which proposes that decaying urban 
centres are replaced with classical architecture, parks and lakes to remedy social unrest and crime. In 
the United Kingdom, the garden city movement also advocates green spaces, fresh air and walking. 


A thick haze that irritates the lungs settles over wartime 
Los Angeles, California. The cause remains unknown until 
Arie Haagen-Smit, a chemist at the California Institute of 
Technology in Pasadena, solves the mystery — the haze 
consists mainly of ozone, which is formed by the action of 
sunlight on the emissions that spew from car exhausts. 


In February, a doctor from Guangdong in China spends a night in 
a hotel in Hong Kong. He is infected with the virus that causes 
severe acute respiratory syndrome (SARS), which spreads to 
other guests who then carry it around the world. By June, 

almost 8,500 people in 30 countries have contracted SARS. 


Brazil is experiencing its worst drought for 80 years. Sao 
Paulo, a megacity of almost 21 million people, starts to 
ration water as reservoirs dry up — and blackouts occur as 
hydroelectric power stations shut down. Water hoarding in 
open tanks brings mosquitoes and with them dengue fever. 
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The urban 
downshift 


Transporting people around the cities of the future is a 
public-policy challenge, but it’s also an opportunity to 
improve the health of urban populations. 
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BY SARAH DEWEERDT 


any cities around the world are 
M spreading faster than their popu- 

lations are growing. According to 
researchers at New York University, between 
1985 and 2000 the population of Accra in 
Ghana increased by 50%, but its land area grew 
by 153%. People are having to travel further: in 
Nairobi, for example, the average commuting 
distance increased from less than 1 kilometre 
in 1970 to 25 kilometres in 1998. As this trend 
continues, governments face the problem of 
how to move people around ever-expanding 
metropolises efficiently enough that residents 
can take advantage of the opportunities — eco- 
nomic and otherwise — that cities have to offer. 

This issue is certainly a public-policy 
challenge, but it’s also an opportunity to improve 
the health of the world’s growing urban popula- 
tion. And researchers and public-health experts 
say that making cities of the future function well 
and support human health may depend on the 
most low-tech, ancient assets available — our 
own two feet. 

“Tf the pedestrian is happy and you see many 
pedestrians, that’s a city with a good transport 
system,’ says Clayton Lane of the Institute for 
Transportation and Development Policy, a non- 
profit organization based in New York. “The 
pedestrian is the indicator species for a sustain- 
able transport system” — and, as it turns out, 
for a healthy one. 

But getting there will require a major shift in 
government spending priorities and in public 
attitude. “In many cities around the world, the 
people and the politicians have this vision of 
modernity that prominently features automo- 
biles,” Lane says. As residents become wealthier, 
the urban infrastructure is remodelled to favour 
cars. Many cities in the developing world spend 
around 70% of their transportation budgets on 
car-oriented facilities, even though around 70% 
of trips take place on foot or by public trans- 
port, Lane says. The result is that the world is on 
track to have 2.3 billion cars by 2050. That's just 
over double the number that were on the road. 
in 2010, and it represents a major threat to the 
health of the urban population. 


ROAD TO HEALTH 
Cars promote a sedentary lifestyle, with its 
attendant risk of health problems such as 
obesity and heart disease. Driving, especially in 
congested traffic, causes stress, and air pollution 
worsens respiratory diseases such as asthma. 
According to the World Health Organization 
(WHO), 1.3 million people die from traffic 
accidents and 3.2 million from lack of physi- 
cal activity every year. Outdoor air pollution 
causes 3.7 million deaths annually, and land 
traffic is responsible for around 5% of deaths 
caused by fine particulate matter and ozone’. 
Cars are responsible for only a portion of that 
pollution (two-stroke scooters are thought 
to contribute a disproportionate amount to 
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urban air pollution), but are “the worst mode 
of transportation in terms of all these catego- 
ries,” says Jeff Speck, an urban planner and 
author of Walkable City (Farrar, Straus and 
Giroux, 2012). 

Much better is what researchers call active 
transport: walking and cycling, mainly. Sitting 
in traffic is stressful, whereas physical activity 
boosts happiness (see page S56). Fewer cars on 
the road means less choking pollution and fewer 
deaths in car crashes. Furthermore, the more 
that people walk and cycle in the city, the safer 
these activities become — both because there 
is safety in numbers and because cities provide 
infrastructure to accommodate these activities. 

In general, people are healthier when they 
are able to do most of their day-to-day activi- 
ties and errands on foot. For example, rates of 
childhood obesity are lower in more walkable 
neighbourhoods’. Among older adults in low 
income neighbourhoods, people living in walk- 
able areas have a lower body mass index than 
those living in areas where moving around on 
foot is difficult’. And moving from a low-walk- 
ability to a high-walkability neighbourhood 
decreases the risk of having high blood pres- 
sure*. “Most cities uniformly seek to improve 
mobility for their citizens, and I think that itself 
is worth questioning,” says Speck. “Mobility is 
often seen as the ideal, when in fact what we 
really want is access.” That means urban plan- 
ning needs to emphasize not just moving people 
around efficiently, but also making sure people's 
needs can be met nearby. 

Of course, walking and cycling are not by 
themselves sufficient to meet people's trans- 
portation needs, especially in the growing 
number of megacities (those with 10 million 
residents or more). But walking and public 
transport support each other. A walkable city 
needs good transport to move people around. 
By the same token, walkable neighbourhoods 
make transport systems more cost-efficient to 
build and help to ensure that they are well used. 
Some studies have found that access to public 
transport improves physical activity and health, 
largely because it gets people walking. 


DESIGN FLAWS 
A well-designed city can encourage habits that 
promote good physical health. “Walking is a 
very simple physical activity that most people 
can do,’ says Yan Kestens, who studies how the 
built environment contributes to public health 
at the University of Montreal, Canada. But mak- 
ing an environment more walkable can be chal- 
lenging — especially in cities that took shape 
after the advent of the car. “The physical struc- 
ture of our cities lasts for centuries,’ says Lane. 
“If we build our cities and suburbs for cars, it’s 
very difficult to retrofit them for walking” 
According to British geographer Adam 
Davies, who recently collaborated with 
researchers at Yahoo Labs on an analysis of 
7-million geotagged photos taken in central 
London, walkability is hugely compromised by 
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street networks designed around the car. “The 
more cars and the more lanes of traffic, the less 
human-friendly that particular street probably 
is, he says. 

There are many reasons that people choose 
not to walk to destinations that are within walk- 
ing distance, Davies says. Lack of pavements, 
inconveniently placed pedestrian crossings, 
and the need to cross a major thoroughfare, 
for example, can make walking unappealing 
or even unsafe. And for some older people, or 
anyone who has trouble walking, factors such 
as these can erode walkability surprisingly fast. 
“Tve heard of stories where people take a taxi 
to go across the street,” says Verena Menec, a 
healthy-ageing researcher at the University of 
Manitoba in Winnipeg. 

Unpicking the subtle barriers that drive peo- 
ple to this kind of extreme is tough — not least 
because people’ real-world behaviour is difficult 
to predict. Menec and her team’ asked middle- 

aged and older adults 


“Mobility is about their walking 
often seen as behaviour and attitude 
theideal, when _ to walkability. Nearly 
infact what we 60% said it was impor- 
reallywantis _ tant to have a grocery 
access.” store within walking dis- 


tance. But of this group, 
76% said that they drive there rather than walk. 
“If people are still within their ‘car mode they 
will probably not actually walk,’ Menec says. 

But some studies have failed to show clear 
links between walkability and better health. 
One analysis® of data from the Nurses’ Health 
Study — a large, long-term epidemiological 
study of women in the United States — for 
instance, found that women living in walkable 
neighbourhoods are exposed to higher levels of 
harmful air pollution. But this relationship var- 
ies in different parts of the country, suggesting 
that exposure to pollution isn’t inevitable. And, 
the more people that walk rather than drive, the 
cleaner the air will be. 

A similarly puzzling set of results comes from 
one of Kestens’ studies’, which found that low- 
income residents in Montreal were less likely 
to walk to places than wealthier people, even 
in parts of the city that were relatively good for 
walking. Kestens says that the field needs more 
qualitative studies to uncover the reasons why 
people do or do not choose to walk. He is also 
using GPS and wearable devices to more pre- 
cisely measure how people get around their 
cities. The goal is to use those insights to help 
design more walkable urban landscapes. 


FIRST STEPS 

Communities and urban planners around the 
world are coming up with creative ways to 
improve neighbourhood walkability. A grass- 
roots effort in the Indian city of Chennai, for 
example, is addressing conditions faced by 
many of the world’s poorest urban dwellers. 
They have no choice but to get around on 
foot; however, they do so on streets that are 


not particularly good for walking. “We have 
city after city where many people are walking, 
yet the city is not walkable at all,” Lane says. In 
such environments, pedestrians are especially 
vulnerable to injury and death from traffic acci- 
dents, according to the WHO. 

The Chennai government has committed to 
spending at least 60% of the city’s transportation 
budget on measures to encourage walking and 
cycling. By 2018, the city is aiming to make 80% 
of its roadways ‘complete streets’ — wide pave- 
ments, bike lanes, space for public transport and 
organized parking, as well as lanes for cars. 

Another piece of the puzzle is developing 
public transport systems to link walkable neigh- 
bourhoods that are within the reach of cities, 
using scarce financial resources. Bus rapid 
transit (BRT) has emerged asa practical, afford- 
able solution for many cities, says Lane, whose 
organization wrote a set of BRT standards, 
because BRT lines are much faster and cheaper 
to build than rail-based systems. Yet, they are 
fast and efficient — they have dedicated lanes, 
preferential treatment at intersections, and plat- 
forms to help people board faster. 

Curitiba in Brazil built the world’s first BRT 
network in the 1970s, with the intention of 
concentrating urban development around 
bus stops along the route — a planning tac- 
tic known as transit-oriented development. 
Although successful for a time, the city’s rapid 
growth eventually overwhelmed the capacity of 
their plan. Curitiba now intends to revisit the 
strategy with a new BRT line and an associated 
development corridor. 

One of the latest converts to the BRT 
approach is sprawling Accra, which is building 
a line between the suburb of Amasaman and 
the city centre. Until now, the city’s transport 
system has been dominated by licensed mini- 
buses known as tro tros, but these only service 
alittle over half the routes that they are licensed 
to run on. Accra is like many cities in the devel- 
oping world that lack a functioning mass trans- 
port system. “There's such a huge, huge deficit 
to address, but BRT is a good solution to do 
it quickly and affordably,” Lane says. “High- 
quality transit is key to a walkable city so you 
can access other parts of the city that are also 
walkable.” m 


Sarah DeWeerdt is a freelance science writer 
in Seattle, Washington. 
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Flooding at a station in Koshigaya near Tokyo. 


Water potential 


Water is anecessity for any city, but too much of it can 
threaten lives and infrastructure. As climate change looms, 
new approaches can help to turn a threat into a resource. 


BY JAMES M. GAINES 


ater is vital, but as anyone who has 
felt the effects of flooding will attest, 
it is possible to have too much ofa 


good thing. For many nations, flooding has 
always been a problem. Take the Netherlands, 
for instance: 26% of the country — and 21% of 
its population — is below sea level, and only a 
complex system of dykes, pumps and dams, 
constructed over hundreds of years, keep it 
dry. But in other countries, flooding is a more 
modern and growing concern — especially as 
absorbent rural land continues to be subsumed 
by the impermeable concrete of our rapidly 
expanding cities. Between 2008 and 2010, 
62% of Chinese cities were flooded. And 39% 
of 351 cities in the country reported at least 
3 serious floods in that time. 

In 2012, the heaviest rainfall in more than 
60 years killed 79 people in Beijing. Climate 
change is likely to make such events more 
common. According to researchers at the 


Massachusetts Institute of Technology in 
Cambridge and Princeton University in New 
Jersey, hurricanes of magnitudes previously 
seen only about once a century could hit land 
every 3-20 years by the end of the century’. 
And, in 2013, researchers estimated that the 
global financial impact of flooding in coastal 
cities would increase from US$6 billion per 
year in 2005 to $52 billion by 2050 (ref. 2). 
The threat climate change poses isn’t so 
much new as it is an amplification of existing 
flooding hazards; those already vulnerable will 
find themselves more at risk. And with cities’ 
high population density and often coastal 
locations — three-quarters of large cities are 
found on the coast — they represent a great 
accumulation of risk. More than half of the 
world’s population already lives in cities and 
towns, and with the global urban population 
set to reach 5 billion by 2030, a single flood has 
the potential to affect millions of people’ lives. 
How climate change is likely to affect the 
risks posed to cities by flooding is a question 
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that researchers, developers and city officials 
around the world are working to understand. 
Faced with the certainty of some degree of 
change, a few cities are already develop- 
ing infrastructure to allow them to control 
excess water — and potentially even turn it 
into a resource. 


RISING TIDES 

Kristina Hill, an urban designer at the 
University of California, Berkeley, is more 
familiar than most with the impact that flood- 
ing can have on an urban area. She worked in 
New Orleans as part of the Dutch Dialogues — 
a Dutch-American collaboration that brought 
together engineers, architects and designers 
after New Orleans was ravaged by hurricane- 
induced flooding in August 2005 to ensure that 
the city would be better prepared in the future. 

Hill is applying what she learned in New 
Orleans to the San Francisco Bay Area. Many 
parts of California are experiencing a historic 
drought, but she believes that in the near future 
coastal cities such as San Francisco will have to 
deal with flooding, caused not by falling rain, 
but by rising tides. 

Sea level in the Bay Area is projected to rise 
by around 1 metre by the end of the century. 
This has the potential to overwhelm defences 
and inundate beach-side areas, but Hill is con- 
cerned about another, subtler effect of sea-level 
rise — water bubbling up from underground. 

“One of the things most people have for- 
gotten, which the Dutch pointed out to me,” 
she says, “is the groundwater connection to 
seawater rise.” 

Seawater is constantly seeping inland, deep 
underground. Because of its greater density, 
the salty water tends to push underneath 
freshwater aquifers. The effect of this is that, 
when sea levels rise, the level of the fresh 
groundwater also moves up. 

This higher water table is likely to exacerbate 
flooding from heavy rains, Hill explains. Just 
as an already sodden sponge is less effective at 
mopping up liquid, so the ground loses some 
of its ability to soak up water from the surface. 
But even without a storm, if sea levels rise 
high enough, the fresh groundwater will be 
pushed to the point at which it begins to leak 
into underground structures, or up through 
the surface. “We'll end up with lots of flooding 
driven by groundwater,’ says Hill. 

The effect that sea-level rise could have on 
groundwater in the Hawaiian city of Honolulu 
could more than double the amount of flood- 
ing that the city will experience owing to 
inundation from the sea alone, researchers 
have suggested’. And, in Miami, Florida, peo- 
ple have reported groundwater bubbling up 
through their gardens at high tide. 

For Hill, the solution lies not in preventing 
sea-level rise, but in adapting to it. “We have to 
learn to live with our feet wet; she says. Water- 
proofing infrastructure such as the subterra- 
nean tunnels that carry essentials, including 
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gas pipes, electrical lines 
and communications cables 
in many cities, is required 
to prevent the disruption 
of power and phone lines 
becoming a regular occur- 
rence. Some infrastruc- 
ture, Hill says, may need 
to be relocated away from 
low-lying ground entirely. 
But in the face of sea-level 
rise and ever more frequent 
storms, she is less than con- 
fident that everywhere will 
be able to cope. Some dis- 
tricts will be abandoned, at 
least temporarily, she says. 
“Cities like Miami, Hono- 
lulu, even parts of New York 
— Queens, Brooklyn, the 
Rockaways, Staten Island. 
There are definitely places 
that are not ready.” 


SPONGE CITIES 


Storm-water 
tree trench 


Green roofing 


CAPTURE AND CONTROL 

An hour’s drive from 
Berkeley, on the opposite 
side of San Francisco Bay, 
is Stanford University — 
one of the participants in 
ReNUWIt (Re-Inventing 
the Nation’s Urban Water 
Infrastructure), a research 

group dedicated to the development of 

urban water management. Here, adaptation is 
the name of the game. Environmental engineer 
and director of the institute Richard Luthy is 
focused on how cities can control flooding, 
particularly that caused by storms. 

Conventionally, cities have dealt with storm 
water by trying to get rid of it as fast as possible, 
says Luthy. “That means building these concrete 
channels and conduits that take the storm water 
to the bay or ocean as quick as we can,” he says. 
If the rainfall is especially strong, however, the 
sudden volume of water can overwhelm these 
drainage systems and lead to flash flooding. 
Slowing down the flow of water through the 
system and diverting some away entirely can 
ease this problem — and possibly even turn 
troublesome storm water into a resource. 

Luthy is working with the Californian cities 
of Sonoma and Los Angeles to improve their 
resilience to flooding by constructing storm- 
water reservoirs. Instead of immediately 
shunting water away down concrete water- 
ways, these large basins will allow storm water 
to pool and collect. 

As well as reducing storm-water-related 
damage, the reservoirs make it easier to har- 
ness the water as a resource. Vegetation planted 
in and around the reservoir purifies the water 
because the pollutants become trapped in the 
soil or the roots, and contaminants are taken up 
by tissues or even broken down into less harm- 
ful substances. The cleaner water can then be 


Rain water is 
captured by 


plants and soil, 
allowing some to 
evaporate and the 
rest to be slowly 
released to drains. 
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Conventional city drains can be overwhelmed by a sudden influx of storm water. In cities such as 
Philadelphia, technologies that allow the streets, pavements and roofs to soak up and store water 
and release it at a manageable rate are being used to reduce the burden on existing sewers. 


1) A perforated pipe distributes 


Pervious paving 


Permeable alternatives 
to asphalt allow water to soak 
through the road, reducing the strain on 
the city’s drains. 


further purified and allowed to percolate into 
underground aquifers beneath the city, where 
it can serve as a drinking-water source during 
droughts. “We see storm water as an opportu- 
nity, not as a problem,’ said Luthy. 

A number of cities are including plants and 
vegetation in water-management plans. Phila- 
delphia’s Green City, Clean Water programme, 
adopted in 2011, uses a variety of techniques to 
control storm water, including reservoirs like 
Luthy’s. “There’s dozens of technologies you 
can employ,’ said Chris Crockett, deputy com- 
missioner of Philadelphia’s water department. 

One such tool is the tree trench: a deep ditch 
alongside the pavement, filled with absorbent 
fabrics, gravel and soil, and then planted with 
trees (see ‘Sponge cities’). As well as adding a 
welcome dash of greenery to the urban envi- 
ronment (see page S56), the trench acts as an 
intermediary step between the storm drain and 
the sewer, using the same principles of slowing, 
capturing and purifying runoff to protect the 
city. The advantage, however, is that the tree 
trench can be installed in areas where a full- 
size reservoir could never fit, such as along 
busy streets. 

Roads and streets are conventionally paved 
with materials such as asphalt or concrete, 
which are impervious to water and cause it 
to collect on the surface and flow into storm 
drains. But engineering can make a difference 


2) Trees take up and 
transpire some water 
held in the trench 


water into a stone-storage layer yaar 


here too. Philadelphia is 
experimenting with pervi- 
ous paving, which allows 
water to soak through a 
porous surface to stone res- 
ervoirs below, where it can 
be captured and stored. 

Even the roofs of buildings 
can be used for flood-protec- 
tion measures. Green roofs 
use plants and soil to capture 
and slow storm water, allow- 
ing much of it to evaporate 
before directing the filtered 
excess into the city’s drainage 
system. 

This drive towards newer, 
cleaner infrastructure to 
control flooding is not just 
a US phenomenon. Faced 
with drainage infrastruc- 
ture that has not kept pace 
with rapid urbanization, 
the Chinese government 
approved the creation of 
16 ‘sponge cities’ in Sep- 
tember 2015. These include 
parts of Chongqing, a city 
of nearly 10 million people. 
China hopes to use many of 
the same technologies being 

developed in California and 
Philadelphia, to quickly drain, 
store and recycle storm water. 

Replacing and augmenting outdated 
sewerage systems and improving a city’s resil- 
ience to flooding does not come cheap — China 
will spend about 400 million renminbi ($60 
million) on each of its sponge cities per year 
for the next 3 years, and Philadelphia's pro- 
gramme will need about $2 billion to fund its 
entire 25-year programme. The price is worth 
it though, says Crockett. If sea levels rise and 
the city fails to adapt, it is predicted that annual 
flood losses could rise from $89 million in 2005 
to more than $1 billion by 2050. 

Through collaborations such as the C40 
Cities Climate Leadership Group, a network 
of more than 80 cities around the world that 
together represent more than halfa billion peo- 
ple, urban areas have led the way in taking steps 
to minimize and adapt to climate change. With 
half the population worldwide in the care of cit- 
ies, how they adapt to a shifting climate will be 
crucial to future global health. As seas rise and 
weather patterns become more unstable, more 
cities may look to the experiments of Sonoma, 
Philadelphia and Chongqing for answers. m 


3) Some water 
is absorbed by 
the earth. The 
excess can be 
slowly released 
into the sewer. 


James M. Gaines is a science writer in Seattle, 
Washington. 
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The woodlands of Rock Creek Park in the centre of Washington DC. 


A natural high 


Exposure to nature makes people happy and could cut mental-health inequalities between 


the rich and poor. 


BY NATASHA GILBERT 


utumn leaves crunch underfoot 
A: I stroll down a path in Rock 

Creek Park, an urban woodland in 
Washington DC. I take a deep breath and 
feel my mood lift. Driving through the 
busy urban streets to reach the park, I had 
been worrying about work and what I was 
going to cook for my family for dinner that 
evening. 

Urban green spaces have value beyond 
their beauty and environmental importance. 
Nature improves mental health — people are 
less depressed when they have better access 
to green spaces. The beneficial effect is not 
just a matter of physical exercise, although 
that is part of the picture. There is something 
about natural environments that improves 
people's well-being, says Richard Mitchell, 
an epidemiologist at Glasgow University, 
UK. Put simply, being in nature feels good. 

Researchers and policymakers are 
increasingly interested in the link between 


green spaces and mood because of the 
implications it could have for preventing and 
treating mental-health problems in society, 
says Hannah Cohen-Cline, a researcher at 
Providence Health and Services in Portland, 
Oregon. Spending time outdoors in natural 
environments not only improves people’s 
mental health, but it could also help to 
reduce health inequalities between the rich 
and the poor. “Being around nature makes 
people feel better mentally. This has impor- 
tant policy implications,” says Cohen-Cline. 

Poor mental health is one of the big- 
gest public-health problems in Western 
nations. For instance, the Organisation for 
Economic Co-operation and Development 
calculates that mental-health conditions 
such as depression cost the United Kingdom 
£70 billion (US$100 billion) annually in 
health-care spending and lost productivity. 
And the epicentre of these problems is cit- 
ies: a 2010 meta-analysis showed that urban 
dwellers are roughly 20% more likely to 
develop anxiety disorders than their rural 
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counterparts, and nearly 40% more likely to 
develop mood disorders’. 


DIGGING FOR EVIDENCE 
Improving access to green space — such as 
parks or gardens incorporated into housing 
developments — in cities could help to cut 
urban stress, improve city dwellers’ mental 
health and reduce the strain on health-care 
systems. But until recently, most studies that 
showed a link between green space and mental 
health were small, short term and involved 
groups of similar people, such as students. 
“These studies have major limitations,” 
says Mathew White, a social and environ- 
mental psychologist at the University of 
Exeter, UK. It’s not clear whether the results 
are applicable to wider populations or that 
the beneficial effects persist over time, he 
says. This is problematic for policymakers 
who want to see the benefits before invest- 
ing in health and social interventions. 
Scientists are working to tackle these 
limitations and strengthen the evidence base. 


JIM LO SCALZO/EPA/CORBIS 
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White and his colleagues were the first to 
study changes in mental health over several 
years as people moved within urban settings. 
They found that when people moved to areas 
with more green space, including tree-lined 
streets, private gardens and public parks, 
they were happier for at least three years after 
their move, and that this feeling of content- 
ment grew over time’. 

The research ranked movers’ well-being 
using the short-form General Health Ques- 
tionnaire (GHQ; a standard clinical tool for 
measuring anxiety and depression on a scale 
of 0 to 12). White used an inverse of the GHQ 
scale so that higher scores represented bet- 
ter mental health. The findings showed that 
when people moved to areas with more green 
space, their average GHQ score rose from 9.8 
two years before their move to 10.1 three years 
after their move. 

The durability of the happiness effect 
surprised White. He expected the boost to be 
short-lived because “people adapt to things 
quickly’ Winning the lottery, for instance, typi- 
cally makes people happier for up to one year, 
he says. The benefits of moving to greener areas 
may last even longer than 3 years (the team only 
looked at 5 years of data in total) — White is 
planning a larger study to find out. 

White acknowledges that, despite lasting 
longer than expected, the benefits to mental 
health seem small. Moving to a greener area is 
only around one-tenth as important for peoples 
happiness as becoming employed, and has one- 
third of the impact that marriage does, he says. 
But, White points out, green space has a greater 
effect on happiness than low crime rate, which 
is often cited as a key determinant of well-being. 
And if the small effect ofa green space is mul- 
tiplied by the thousands of people who use it, 
that adds up to “a large public-health impact,” 
he says. 

Until recently, most studies had been unable 
to control for the genetic variation that sees 
some people respond more positively to green 
space, so it has been difficult to definitively say 
whether the benefits are due to the green space 
or to a person’s genetic makeup. But Cohen- 
Cline has unpicked the drivers of mental health 
using twins. Because twins share at least half 
of their genes, and those who took part in the 
study were raised in the same environment, the 
researchers were able to control both the genetic 
and environmental factors’. “This is important 
because we know that genetics and childhood 
environments playa key role in the risk of devel- 
oping mental-health issues,” says Cohen-Cline. 

The authors found that green spaces have a 
direct mental-health impact. People with better 
access to green space had slightly fewer depres- 
sive symptoms than those in less green areas. 
Independent of any potentially confounding 
factors, such as childhood environment and 
genetics, “there is something about green space 
itself that benefits people's mental health,’ says 
Cohen-Cline. Although the twin study shows 
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Green spaces improve mental and physical health. 


that green spaces make people happier, it does 
not say how this works. “It is doing it through 
several different pathways, and we are still try- 
ing to tease that apart,” she says. Exercise is 
known to improve mood, but Cohen-Cline’s 
study found no evidence that it substantially 
changed people's depressive score, “suggesting 
that it is not what is driving the association” 

One route could be that parks allow people to 
socialize, which in turn improves their mood. 
“Social ties are very strongly associated with 
mental health,” says Cohen-Cline. Mitchell is 
putting his money on another route — peo- 
ple’s perception of nature causes physiological 
changes, such as reducing the stress hormone 
cortisol and lowering blood pressure. “You per- 
ceive nature with your senses,’ Mitchell says. 
“Your brain processes those sensory experi- 
ences and triggers physiological responses.” 

Evidence for why this would be is so far 
thin, but theories abound. One possibility is 
that people's brains are overexposed to stress- 
ful stimuli such as noise and overcrowding in 
urban environments. By contrast, Mitchell says, 
natural environments give the brain an oppor- 
tunity to recover from mental fatigue. It’s also 
possible that our evolutionary heritage means 
we are simply hard-wired to respond positively 
to the green spaces that our ancestors grew up 
in. “We're faced with stressful, noisy environ- 
ments. When we encounter an environment 
that is more in keeping with our evolution, that 
we might innately perceive as more supportive, 
our bodies and minds react favourably; we lit- 
erally relax’, says Mitchell. 

Whatever the underlying explanation, 
there is evidence that green spaces elicit a 
direct physiological response, says Mitchell. 
In Japan, for example, people who spent time 
participating in Shinrin-yoku — sitting or 
walking in a forest — had lower cortisol con- 
centrations, pulse rates and blood pressure 
than when they visited the city*. And it’s not 


just parks and forests; blue spaces such as the 
sea, canals and lakes may give an even bigger 
boost to people’s mood’. 


GREEN INTERVENTION 

As the evidence grows, policymakers will be 
able to design health interventions that use 
natural resources. The therapeutic and soci- 
etal value of green spaces is already starting to 
draw attention. “Policymakers are taking on the 
message that they have a resource that might 
be good for people's health and well-being? 
says Mitchell. The £8.9-million restoration of 
Clissold Park in northeast London in 2011, for 
instance, was highlighted by the UK govern- 
ment agency Public Health England in 2014 as 
an example of a local health intervention. 

But creating the spaces isi’t enough — says 
White, there are strong “psychological bar- 
riers” that prevent some people from using 
green space. Just 40% of the UK population will 
spend time near nature in any given week, and 
although a lack of time is the main reason given, 
he says, others say they don't enjoy spending 
time outside, or that it’s not part of their culture. 

To engage those most in need, White thinks 
that health services should offer people with 
depression ‘green prescriptions, which would 
encourage them to join walking groups or allot- 
ments, for example. Physicians could offer this 
before or as well as drug treatments. White is 
attempting to work out how these green pre- 
scriptions could work in practice, and the 
potential cost saving for health services. 

If participation can be improved, one area 
that may benefit the most is health inequality. 
Contentmentis not evenly distributed across the 
socio-economic spectrum: affluence is generally 
associated with greater happiness. But evidence 
is beginning to show that green spaces could 
narrow this gap. Mitchell and his colleagues 
found that access to green spaces could reduce 
inequality in mental well-being by 40%". 

“Tt is a sizable reduction. Nothing else govern- 
ments have tried has really had much impact,” 
says Mitchell. The study has its limitations — 
despite the strong association between green 
space and decreasing mental-health inequality, 
there's no proof of causation — but Mitchell is 
clear: “having a park in your neighbourhood has 
a greater benefit on poorer people.” 

“The effects are largest in poorer communities, 
says White. “Rich people are healthy already: = 


Natasha Gilbert is a science writer based in 
Washington DC. 
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Nancy Ybarra is one of the caretakers of Pogo Park — a community project in Richmond, California. 


The privilege of health 


Deprivation leads to stress, and stress to bad health. A park, and the science behind it, 


aims to break that chain. 


BY AMY MAXMEN 


hildren play on foam mats bathed in 
( late afternoon light as a few staff at Pogo 

Park hang Christmas lights. Kids squeal 
as they slide along a zip wire, and a few teen- 
age boys bounce a basketball against concrete 
painted in pastel designs. It’s hard to believe 
that this is the Iron Triangle — the most deadly 
part of Richmond, California, one of the most 
dangerous cities in the United States. 

One of the park's caretakers, Nancy Ybarra, 
now 26, played in the park as a child when it 
was a dirt patch with two tire swings. Men rolled 
dice under the trees by day, drank beer and took 
drugs at night. Ybarra remembers days when 
police tape surrounded the park after shootings. 
But the park is safe now. It looks like “a park that 
could be in a white neighbourhood,’ Ybarra 
says. As we walk along the path that encircles it, 
Ybarra describes how the park is more than just 
well designed. It was built, and is now staffed, by 
people from the neighbourhood, and it serves a 
variety of functions, ranging from employment 
and childcare to empowerment. 

These might seem like social issues, but the 


park has been justified as a physical-health 
intervention. In Richmond, as in poor cities 
around the world, high rates of heart disease, 
asthma, diabetes and infant mortality corre- 
late with certain social and economic traits. In 
2010, unemployment in Richmond was 20%, 
38% of the children were living in poverty and 
the city had the sixth highest crime rate in the 
United States. 

Statistics such as these correlate with negative 
health outcomes in cities around the world. 
More than 25% of Richmond’s children have 
asthma, compared with a Californian average 
of 15%. About 40% of the city’s children (and 
62% of adults) are overweight or obese, com- 
pared with the state's average of 30% for kids. 
A child born in Richmond is expected to die 
11 years earlier than one in the richer city of 
San Francisco just across the bay. For these rea- 
sons, says Jason Corburn, an urban planner and 
epidemiologist at the University of California, 
Berkeley, “zip code matters as much as genetic 
code in understanding a person's susceptibility 
to disease and premature mortality” 

Corburn focuses on how to improve 
the health of city residents. He argues that 
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cities must transform their physical and social 
environments — everything from their parks 
and housing, to the way that community mem- 
bers participate in their city’s governance — to 
improve the health of citizens. Richmond is a 
short drive from Corburn’s office at Berkeley, 
and the city council has been receptive to his 
ideas. One of the council’s earliest moves was 
to support local entrepreneur Toody Maher to 
build Pogo Park. And in 2014, the city enacted 
a ‘Health in All Policies’ ordinance. This policy 
strategy required all municipal decisions to take 
into account both the physical aspects of health, 
such as safety and food availability, and social- 
health issues such as stress. 


TOXIC STRESS 

For the past 20 years, researchers have worked 
to solidify the links between chronic psycho- 
logical stress and physical health. In 1996, Steve 
Cole, a genomics researcher at the University 
of California, Los Angeles, noticed that HIV 
progressed faster in gay men who concealed 
their homosexuality than in men who were 
open about it'. Cole and his colleagues found 
that men who concealed their sexuality were 
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more psychologically sensitive to social threats, 
and that stress correlated with increased activ- 
ity of the sympathetic nervous system (SNS). 
The basis of the body’s fight-or-flight response, 
the SNS kicks into action by releasing adrena- 
line and noradrenaline, priming the body for 
physical activity. Over the next decade, Cole and 
others described how SNS activation also shuts 
down proteins involved in fighting viruses, 
called interferons, allowing HIV to replicate 
without impediment. 

The finding tempted Cole and Greg Miller, 
a psychologist at Northwestern University in 
Evanston, Illinois, to explore the connections 
between stress and other diseases besides HIV. 
They were particularly interested in subtle, 
chronic stress, as opposed to the panicky reac- 
tion triggered by, say, a near-death accident. 

Epidemiologists knew that asthma was 
disproportionately common in children from 
poor urban communities, even when fam- 
ily history and air quality were controlled for. 
To find out whether stress played a part in the 
asthma prevalence, Miller and his colleagues 
asked children about their family relationships, 
friends, school life, home life and neighbour- 
hood. Children with asthma from poorer 
backgrounds, and in more stressful situations, 
tended to have higher counts of white blood 
cells called eosinophils”. Given the right aller- 
gic trigger, eosinophils kick off the production 
of molecules that constrict the airways as well 
as mucus production, resulting in shortness 
of breath. And eosinophil recruitment, the 
researchers found, follows SNS activation and 
the subsequent production of the inflammatory 
molecule interleukin-5. 

Biological connections to chronic stress have 
also been found in conditions such as cardiovas- 
cular disease and type 2 diabetes. People from 
low socioeconomic communities regularly 
show signs of chronic stress — levels of stress- 
related hormones such as cortisol are often 
higher, for instance. Cole 
suspects that the chronic 


“We don’t 
stress associated with pov- ° 
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economic resources, you 
might develop a blind faith that you can figure 
out problems,” he explains. “But without those 
resources, there’ a baseline of uncertainty.” 

Stress can stem from a lack of control. In 
post-industrial cities in Finland, researchers 
have suggested that shame and low self-esteem 
arising from high rates of unemployment 
account for the heavy burden of health issues in 
these areas compared to the country’s wealthier 
cities. For Finnish men who were unemployed 
twice over a 3-year period, the small risk of 
death in the 3 years that followed was 168% 
greater compared with men who had been 
unemployed only once’. 

Psychological stress among the urban poor 
is distinct from that of those who live in rural 
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Pogo Park could help to improve physical health. 


areas, perhaps because income disparity is not 
as obvious. “Cities expose you to inequality. 
You see what apartments in rich neighbour- 
hoods look like, which you cannot afford, you 
know how much food costs at restaurants in 
those places,” says Eldar Shafir, a behavioural 
scientist at Princeton University in New Jersey. 
“Well-being is heavily impacted by compari- 
sons,’ Sharif adds. “It impacts your evaluation 
of self-worth and self-identity” 


REDUCING THE BURDEN 

In poor areas of Richmond, there is no shortage 
of sources of stress. Violent crime makesa stroll 
daunting. Poverty means that people worry 
about where their next meal will come from. 
Undocumented citizens are on alert because 
of their precarious status. Institutional racism 
can lead to feelings of inadequacy and self- 
defeat. With stressors such as these at play, 
Richmond’s policymakers have realized that 
medical approaches, such as access to asthma 
medication, alone will not improve the health 
of Richmond. Gabino Arredondo, Richmond’s 
Health and Wellness Coordinator says, “when 
it comes to health, people usually think about 
doctors and clinics, but we're focusing on 
upstream interventions.” 

To understand what stresses residents, 
Corburn, Arredondo and their team combed 
through surveys of about 4,000 residents col- 
lected since 2007. They also spoke with com- 
munity-based organizations and analysed 
geographical data. Areas where violence occurs, 
car accidents happen and supermarkets are 
rare were associated with stress. A wide vari- 
ety of interventions were needed to address the 
many issues at play. As a result, Richmond has 
prioritized measures suchas speed bumps, lead 
removal in houses, training to address implicit 
bias for police officers, and events that help resi- 
dents to enrol in health insurance. 


It may take a decade for Richmond’s 
comprehensive approach to have a demonstra- 
ble effect on disease — particularly for condi- 
tions such as cardiovascular disease that take 
years to develop. However, Arredondo argues 
that if policymakers hold out for this type of 
evidence before taking action, the biomedical 
repercussions of letting another generation 
grow up in stressful conditions will be costly. 
“We dont have time to wait,” Arredondo says. 
“This is urgent.” So the city is plunging ahead. 

As it proceeds, the team keeps track of 
quantitative indicators, such as the rate of 
asthma in children, as well as survey data 
that assess residents’ opinions over time. The 
results have provided some glimmers of hope. 
For example, Corburn says that according 
to the 2015 surveys, people who identified 
themselves as belonging to a non-white group 
and those of a low socioeconomic status 
reported better perceptions of safety, greater 
economic and recreational opportunities, 
and felt more included in the city compared 
to 2009. He has shared these results with the 
World Health Organization, and hopes that 
Richmond’s approach will provide insight 
that policymakers around the world can use 
to improve the health of the urban poor. 

Maher initially pitched her dream of Pogo 
Park to members of Richmond’s community 
and to employees of the city, such as Arredondo. 
Collectively, they successfully made the case 
to the state of California that funding the park 
could help to ameliorate the isolation, low self- 
esteem and uncertainty that diminishes the 
health of Iron Triangle residents. Unlike previ- 
ous Richmond green-space projects, which had 
fallen into disarray, Pogo Park was designed and 
constructed by residents. Today, they watch over 
it. Eddie Doss, a 59-year old who lives across 
the street, keeps an eye on the park at night: “If 
people come here to drink beer, I tell them no, 
that’s against the rules, and they say, OK Eddie.” 

This year, Richmond secured a US$6.2 mil- 
lion grant from the state to build just over 
3 kilometres of safe streets that connect Pogo 
Park to another park in the Iron Triangle. 
Pogo Park’s success helped Richmond's proposal 
to beat those from competitors in other cities. 
Residents who work and volunteer at the park 
could not be happier with the win — although 
they, more than anyone, understand thatit’s just 
one of many components needed to reduce their 
burdens. “This park will not solve violence,’ says 
Ybarra. But, she adds, “I hope that the kids I take 
care of here will not become that shooter, they 
will not become that dope dealer. I’m watching 
them grow up well.” m 


Amy Maxmen is a freelance science writer 
based in Berkeley, California. 
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increasing numbers. As towns and cities grow, farmland is 

being concreted over. Urban agriculture is often considered 
to be the future, at least for some components of a city dweller’s 
diet. But cities are far from prime agricultural land, and it cannot 
be assumed that food produced in urban areas is safe to eat. 

Growing food in urban environments seems like an attractive 
proposition, partly because of the inherent sustainability: waste 
heat that all cities generate can be harnessed, and grey water (waste 
water from baths, showers and kitchen appliances) or surface run- 
off and nutrient-rich sewage effluent can be recycled. As well as a 
sustainable use of brownfield sites, it can reduce the carbon foot- 
print of food transport and can make cities greener. For poorer 
families, urban farming produces an income and can diversify 
diet. Community-based projects can promote social interaction 
and outdoor activity for a double dose of health benefits. Areas 
designed with urban farming in mind — such as vertical farms 
(growing plants up the sides of buildings, for 
example) and patchworks of fields between, on 
top of or within blocks of buildings — could 
shape our future cities. Rooftop and indoor 
farming would further increase the land area 
available for agriculture. 

But before urban farming can be expanded 
on a large scale, a key issue that differentiates 
cities from the countryside must be consid- 
ered: pollution. Current and past industrial 
activity, dense and dirty transport infrastruc- 
ture, domestic burning of fossil fuels and the 
plethora of chemicals released into domes- 
tic waste streams all pollute the soil of cities. 
Urban sewage and grey water carry detergents 
and excreted drugs — concerning because 
many of these are hormonally active — as 
well as a large suite of industry-derived con- 
taminants. In addition to contaminants from 
suspended soil dust, city air already has elevated levels of nitro- 
gen oxides, sulfur oxides, hydrocarbons and particulates from car 
exhausts. Air pollution is known to reduce urban crop yields’, but 
the consequences of ingesting foodstuffs covered in these pollutants 
are not well understood. 

Urban soil pollution is not uniform across a city — some areas are 
severely contaminated, whereas others are cleaner and more suitable 
for farming. But, in general, fruit and vegetables produced in city 
environments contain more undesirable substances than rural pro- 
duce. Whether this increased pollutant burden constitutes a health 
risk is a matter of debate”. However, to ensure that commercially pro- 
duced food from urban farming is safe, systematic monitoring must 
be in place. Each potential location should be screened for contami- 
nants and each variety of produce grown must be analysed, because 
of the differences in the accumulation of contaminants. For exam- 
ple, an extensive survey of fruit and vegetables in a historic mining 
region of southwest England showed that soil-dust-contaminated 
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City farming needs monitoring 


Pollution poses a significant challenge to food production in urban 
environments, says Andrew A. Meharg. 


leafy vegetables and tuber crops contaminated by direct soil contact 
were important sources of toxic metals in commercial horticultural 
produce’. Monitoring of this kind is intensive, but the consequences 
of unsafe produce entering the food chain impel us to implement 
these measures for crops grown in cities. 

Pollution in urban farming flows in both directions. Just as the 
city can contaminate agricultural produce, farming itself can intro- 
duce unwanted chemicals into the environment. Water supplies 
can be polluted by inorganic fertilizers and manures, which lead 
to excessive build-up of algae and aquatic plants in nutrient-rich 
waters, as well as by pesticides. Noxious smells, excessive noise and 
the stirring up of soil dust into the atmosphere while working the 
land are all by-products. And having farmland in close proxim- 
ity to the public is problematic, particularly for infants who may 
ingest the soil. 

One way to allow intensive agricultural activity in urban centres, 
but to avoid potentially contaminated soil, is hydroponics — 
growing plants without soil, in water. How- 
ever, as many cities struggle to supply water 
for domestic and industrial use, large-scale 
hydroponic farms would be a further burden 
on a precious resource. Although in principle 
hydroponic water could be recycled, this is 
not always feasible, owing to the water’s high 
nutrient and, if used, pesticide content. In hot 
climes, bringing water for agriculture into the 
centre of cities can lead to increases in pests 
such as malaria-carrying mosquitoes’. 

It may be possible to build new cities that 
avoid the current contamination issues. 
But in existing cities, where urban farming 
is an afterthought, some lateral think- 
ing is required to give urban agriculture a 
future. Growing non-food crops such as 
textile fibre plants, biomass crops and tim- 
ber would make use of urban and suburban 
waste land, green the city, recycle waste water and biosolids, 
and produce crops that currently take up rural land that is ideal 
for food production. Whether farming in cities is cost-effective 
or not, the non-economic returns such as better living spaces 
that facilitate social interaction through community-based activ- 
ity must be considered. If urban waste resources can be recycled 
into energy, building materials and clothes through farming, 
everyone will benefit, ultimately making our cities healthier 
environments. m 


Andrew A. Meharg is a plant and soil scientist at Queen's University 
Belfast, UK. 
e-mail: aa.meharg@qub.ac.uk 
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Madina market in Conakry, Guinea. Densely populated urban environments are ideal for the spread of infection. 


Poverty and pathogens 


The growth of slums in the developing world’s rapidly expanding cities is creating new 
opportunities for infectious disease to flourish and spread. 


BY MICHAEL EISENSTEIN 


s Lee Riley read article after article about 
A« deadly 2014 Ebola outbreak, his 
frustration mounted. “I was seeing all 
of these newspaper reports and even scientific 
reports talking about this unprecedented epi- 
demic in West Africa,’ says Riley, a specialist in 
urban public health at the University of Califor- 
nia, Berkeley, “and there wasn'ta single mention 
of the words ‘slums or ‘informal settlements”” 
Ebola is feared because of its high mortality 
and limited treatment options, but generally it 
has been limited to remote rural regions. The 
2014 outbreak was different: flare-ups in cities 
such as Conakry in Guinea and Monrovia in 
Liberia revealed the havoc that this lethal virus 
could wreak in urban environments. The dense 
and highly mobile populations provided greater 
opportunities for the infection to spread. And 
according to Mosoka Fallah, an epidemiolo- 
gist who was working with Liberia's Ministry 


of Health at the front line of the Monrovia 
outbreak, urban slums bore the brunt. “Wher- 
ever there were big outbreaks, most people 
being infected were among the poor, he says. 
“Those that didn't have basic sanitation, who 
had the most distrust of institutions — they also 
had the most disease.” 

Developing nations have experienced an 
astonishing boom in urbanization in the past 
few decades. The urban population of Kenya, 
for example, has grown at an average rate of 
4.3% per year since 2010, as rural citizens 
have moved to cities in pursuit of new oppor- 
tunities. “The range is between 3% and 6% 
in most of Africa, says Robert Breiman, an 
infectious-disease epidemiologist at Emory 
Global Health Institute in Atlanta, Georgia. 
Riley says that there has been a similar trend 
in Brazil, where 85% of the population now 
lives in cities. Many migrants initially make 
their home in informal settlements at the 
city periphery. The United Nations Human 


Settlement Programme UN-Habitat estimates 
that 863 million people — one-third of the 
developing world’s urbanites — live in slums. 
Although better access to medical care means 
that the health of city dwellers across the socio- 
economic spectrum is generally superior to that 
of their rural counterparts, cities can also pro- 
vide greater opportunities for infectious diseases 
to flourish. Crowding and poor or non-existent 
infrastructure exacerbate the risk of infectious 
disease to slum inhabitants in particular. How- 
ever, as demonstrated by the rapid spread of the 
Zika virus over the past year, outbreaks can bea 
threat to entire cities, nations and — thanks to 
globalization — the rest of the world. 


CLOSE QUARTERS 

Breiman has worked extensively in Kibera, 
one of the oldest and most established slums 
in Nairobi. Some attempts have been made to 
assimilate this area into the capital’s infrastruc- 
ture — with mixed results. “There is piped 
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water that comes in, but it is tapped into by local 
water sellers,’ he says. The damage they cause 
to the pipes makes it easy for contamination 
to occur. “There’s a lot of running sewage, and 
oftentimes it mixes with the corrupted water 
pipes.” Less established slums have no piped 
water or waste disposal at all. 

These conditions are ideal for the spread of 
diseases such as cholera, a bacterial diarrhoeal 
infection transmitted through contaminated 
food and water. “We still have lots of epidem- 
ics of cholera in Africa, 
and these are typically 
associated with sanita- 
tion,” says Amadou Sall, 
an infectious-disease 
researcher at the Pasteur 
Institute in Dakar, Sen- 
egal. Typhoid infection 
occurs through a similar 
route. Children in Kenya's 
informal settlements have 
a one in five chance of 
contracting typhoid by 
age ten, says Breiman. In 
his view, this problem is 
entirely attributable to 
poor living standards. 
“When typhoid went 
away in New York City 
and London, and else- 
where, it didn’t go away 
because of vaccines,” says Breiman. 
“Tt went away because there were 
physical changes in these environments, and 
people were living in a different manner” 

The rise of urban Ebola was a product of 
overcrowding and inadequate medical care, 
as well as close ties between people who had 
recently arrived in the city and their home vil- 
lages. The first case diagnosed in Conakry was 
contracted at a rural family funeral and only 
turned symptomatic after the person returned 
to the city. “When that patient got sick, they 
went to the hospital and initiated several infec- 
tions among the health-care workers,’ says Sall. 
Instead of going to hospitals, many other people 
in Conakry went to makeshift medical facili- 
ties in spaces that lacked the resources to pro- 
tect carers or patients, such as peoples’ homes; 
this did more to spread the infection than to 
contain it. The urban poor often travel far and 
wide to find work, greatly increasing the area 
that could be affected by the virus and making 
contact tracing — finding and monitoring peo- 
ple exposed to infected individuals — a daunt- 
ing undertaking. “In our latest outbreak, we had 
over 165 contacts that had spread across almost 
60% of the city,” says Fallah. 


COUNTRY BUG, CITY BUG 

Unlike Ebola or typhoid, many diseases 
require an intermediary vector — often an 
insect — to jump from person to person. For 
these pathogens, becoming a permanent fea- 
ture of the urban environment is a protracted 


A warning of Chagas 
disease transmission 


process because the vector must first establish 
a foothold there. 

An abrupt shift from an agricultural lifestyle 
to city living can facilitate this process. Chagas 
disease, a parasitic infection spread by insects 
called triatomines, has long plagued rural parts 
of Latin America. But over the past few dec- 
ades, it has begun to show up in cities as well. 
The insects accompany rural migrants — and 
the parasite-infected animals they bring with 
them — to mid-altitude cities in the Americas, 

such as Arequipa in Peru. 


a = Sy Here, the primary vehi- 


cles for the disease are 

guinea pigs — a popular 

staple of the Peruvian 

diet. The poorest immi- 

grants typically congre- 

gate in makeshift, illegal 
settlements known as 
invasiones. These poorly- 
constructed shanty 
towns have the makings 
of an ideal home for tri- 
atomines, which like to 
dwell in wall cracks. But 
opportunities to feed are 
greater in areas that are 
more densely populated 
than the relatively spartan 
invasiones. Because of this, 
it is the equally poor, but more 
established areas of the city nearby 
— those that have made the jump 
from illegal camps to legitimate urban dwelling 
— that face the worst infestations. Arequipa’s 
invasiones are merely the entry point for the 
insects. “Once people get their land titles, there's 
this infusion of building materials, dogs, guinea 
pigs and other animals, and more people — 
that’s the tinderbox that allows the bug to take 
off? says Michael Levy, an epidemiologist at the 
University of Pennsylvania in Philadelphia, who 
has worked in Arequipa for more than a decade. 
Further improvements to living conditions — 
less crowding, fewer animals and higher-quality 
homes — seems to be a deterrent to the insects, 
says Levy. 

Whereas triatomines are restricted to 
certain parts of the Americas, the mosquito 
Aedes aegypti has become entrenched in cities 
throughout the tropics, largely driven by Asian 
and African industrialization. This mosquito, 
which transmits the viruses responsible for den- 
gue and yellow fever, normally breeds in tiny 
pools of water such as those found in holes in 
trees. But the rise of cars and plastics has given 
A. aegypti appealing new options. “Tires that 
wear out would get discarded in the environ- 
ment, and they make ideal larval habitats,” says 
Duane Gubler, a specialist in tropical medicine 
at Duke-NUS Medical School in Singapore. 
The insects also thrive in the water receptacles 
used by households without ready access to 
municipal services. 

For one major vector-borne disease, 
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urbanization actually provides some protection. 
The Anopheles mosquitoes that transmit 
malaria are much more particular about where 
they raise their young than Aedes. “They like 
clean, sunlit pools of water with vegetation 
around them, which are not typically found in 
urban areas,” says ecologist and epidemiologist 
Andrew Tatem at the University of Southamp- 
ton, UK. By combining detailed satellite data on 
urbanization in Uganda with household surveys 
for mosquitoes and data on malaria incidence, 
Tatem and his colleagues have observed a pat- 
tern (S. P. Kigozi et al. Malaria J. 14, 374; 2015). 
“It seems pretty clear that if you move to an 
urban area, youre at less risk of getting malaria,” 
he says. There are some exceptions: gardens at 
the city edge can offer breeding grounds, and 
a species of malaria-transmitting mosquito 
(Anopheles stephensi) found in India can pro- 
create in small water vessels much like Aedes 
can. “But in the concrete jungle, urban centre, 
you'll generally get zero breeding,” says Mark 
Wilson, an ecologist and epidemiologist at the 
University of Michigan in Ann Arbor. 


NEIGHBOURHOOD WATCH 

The 2014 Ebola outbreak was a trial by fire, 
testing countries’ capacity to deal with infec- 
tious disease. There was a stark divide between 
West African countries that promptly brought 
the epidemic under control and those that did 
not. Preparedness was a crucial distinguish- 
ing element, according to Breiman. Nigeria in 
particular benefited from a robust infrastruc- 
ture for polio eradication, which was rapidly 
converted into a strategy for eliminating Ebola 
— the country was free of the disease after 
3 months and 19 locally transmitted cases. 

Such infrastructure is generally most 
developed in cities. Better community surveil- 
lance and prompt delivery of medical care are 
key advantages for battling the spread of infec- 
tion. But even a well-designed rapid response 
can falter in the slums. Conakry established 
a dedicated Ebola-treatment centre within a 
month of the first reported case, but Sall and 
his colleagues encountered numerous hurdles 
among disenfranchised residents of the city’s 
poorer districts. “People there tend to mistrust 
the government and the health system,” he says. 
People kept their illness secret because they 
were afraid of being isolated from their fami- 
lies, and saw no value in reporting to a hospital 
if no cure was available. “Some people didn't 
want to go because they were worried about 
organ smuggling,’ says Sall. Fallah says that the 
situation was similar in Monrovia, with families 
performing secret — and unsafe — burials of 
their dead. 

Grass-roots efforts made the crucial differ- 
ence. The initial outbreak in Monrovia raged for 
months, but Fallah and his team regained con- 
trol by conducting public meetings to discuss 
fears and concerns. “Many people brought very 
important suggestions, and this also brought 
us in touch with influential members of the 
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community,’ he says. These connections helped 
track down slum-dwelling contacts who would 
otherwise have been missed. It also allowed 
the formulation of public-health practices that 
were compatible with the community’s ethnic 
customs. “By getting engaged with religious 
leaders,” says Fallah, “we developed methods to 
make burials safe while keeping them dignified” 


VANQUISHING VECTORS 

Community-centred strategies have not always 
been enough. Eliminating Aedes mosquitoes, 
for example, has often demanded aggressive 
government intervention. Beginning in the 
1930s, US epidemiologist Fred Soper waged 
a successful war on dengue and yellow fever 
across Latin America — an effort facilitated by 
the dictatorial governments in power through- 
out the region. “The programme had a para- 
military type of structure,’ says Gubler. Soper 
and his team were given free rein to check every 
building in a city for mosquito habitats. In simi- 
larly heavy-handed fashion, Chile eliminated 
the Chagas vectors triatomines in its cities under 
the brutal regime of Augusto Pinochet with a 
door-to-door campaign overseen by the mili- 
tary police. “Insecticide sprayers would go to 
each house, and the carabineros would be with 
them,’ Levy says. “They ask if youd let them 
in — and everybody said yes!” 

Successful as these strategies may have been, 
the cost to personal liberty makes them impos- 
sible to endorse. Thankfully, in Arequipa, Levy 
and his Peruvian colleagues have made con- 
siderable progress through more reasonable 
means, particularly in poorer quarters where 
awareness of Chagas disease is high. Although 
in the richer districts, which have often had the 
luxury of ignoring the 
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only 64% in richer 
neighbourhoods. In 
these communities, 
says Levy, “the campaign has gone from being a 
wonderful public-health benefit to an imposi- 
tion.” His group is exploring incentives to boost 
participation, including community-leader 
training and raffles with jackpots that reflect the 
extent of the community’s involvement. 
Arequipa’s triatomines are now largely under 
control, but vectors can quickly return if cities 
lower their guard. This is especially true of 
A. aegypti, because it procreates more rapidly 
and spreads more aggressively than triatomines. 
But convincing people to deal with the insects 
before they become a problem can be tough. As 
head of the US Centers for Disease Control and 
Prevention (CDC) dengue-control programme 
in Puerto Rico in the early 1980s, Gubler led a 
community-based effort to train residents to 
evict the mosquitoes from their back gardens. 


sanitation.” 
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A water inspection to look for larvae of Aedes mosquitoes, which carry the Zika virus. 


The results were disappointing. “Nobody con- 
trolled the mosquitoes in their yards or houses 
until there was an epidemic,” he recalls. Ulti- 
mately, a multipronged attack may be the only 
real solution — thorough habitat eradication, 
sensitive diagnostics, access to durable insecti- 
cides and, ideally, a vaccine. 


A GLOBAL DIAGNOSIS 

Although slums provide fertile ground for 
infectious disease, the problem seldom remains 
local. Various species of Aedes mosquitoes 
have spread through the trade routes that link 
the world’s cities. China's investment in urban 
development in Africa, for instance, has brought 
the Asian tiger mosquito Aedes albopictus to the 
continent. And once the Aedes mosquito makes 
itself at home, a sufficiently virulent strain of 
dengue can quickly cause a serious outbreak. 

Outbreaks of dengue are facilitated by 
globalization. Singapore’s anti-Aedes pro- 
grammes, which were bolstered after a period 
of relaxation during the 1980s, have proved 
inadequate against introduction of the virus by 
visitors — including large numbers of migrant 
workers — from endemic regions in south and 
southeast Asia. A similar process allowed the 
Aedes-borne Zika virus, which was once con- 
fined to Asia and Africa, to establish itself in 
Brazil — one theory suggests that it may have 
arrived with the influx of tourists for the 2014 
football World Cup. The virus has rapidly 
expanded its reach — the World Health Organi- 
zation (WHO) is anticipating as many as 4 mil- 
lion cases throughout the Americas. “Zika virus 
has been around for many decades,’ says Riley, 
“but it wasn’t until it entered massively urban- 
ized Brazil that it spread like wildfire” 

To prevent such propagation, the global 
public-health community must first intervene 
to protect the populations at the epicentres of 
emerging outbreaks. The Ebola epidemic served 


as an important call to arms. In Liberia, for 
example, what began asa crisis has turned into 
an opportunity to develop health-care capacity. 
“We have built up a competent corps of Libe- 
rians who understand the disease well,’ says 
Fallah. “We now have ambulances and contact- 
tracers and lots of partners like CDC, WHO and 
Unicef working with us.’ Such efforts could dra- 
matically reduce the severity of future outbreaks. 

Nevertheless, the fundamental problem of 
the slums remains. “If we're going to prevent 
major epidemics,’ says Gubler, “we need bet- 
ter living standards — better housing, reliable 
piped water, good sewage and good hygiene in 
these tropical urban centres. That’s not going 
to happen any time soon.’ Some governments 
have dealt with their slums by simply bulldoz- 
ing them away. Others have pursued more 
sympathetic relocation programmes. 

But community is essential to quality of life 
— especially for the urban poor — as Brazil 
discovered after it transplanted slum-dwellers 
to high-rises in the 1970s. The government 
provided a tremendous amount of new infra- 
structure, but the consequences were not all 
positive. “This just displaced the social network 
that these people had,’ says Riley. This upheaval 
“engendered crime and other disruption of the 
social fabric”. Over the past decade, says Riley, 
Brazil's government has focused on improving 
the slums themselves, providing parks, child- 
care centres, electricity and drinkable water. In 
Fallah’s view, recognizing this social fabric and 
partnering with the community leaders who 
bind it together offers the most cost-effective 
way to protect urban public health. “You have 
to have a consistent presence in the commu- 
nity, and not just a sporadic presence whenever 
there is an outbreak.” m 


Michael Eisenstein is a freelance science 
writer in Philadelphia, Pennsylvania. 
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Urban physics 


Cities are complex environments. Planning interventions that borrow principles from 
theoretical physics could help to improve peoples’ lives. 


BY KEVIN POLLOCK 


he ideal city. That was what the French 
Tost in mind when they made Hanoi the 

capital of colonial Indochina at the turn 
of the twentieth century. To modernize the trop- 
ical city, French administrators, technocrats and 
engineers built railways, bridges and an opera 
house. But rather than becoming a symbol of 
success in their southeast Asian colonial project, 
Hanoi became an example of urban apartheid. 
The European quarter was renovated with wide, 
tree-lined avenues, and large, spacious villas. 
Whereas the old quarter, home to the native 
Indochinese, was a place of narrow roads and 
overcrowded buildings. 

One part of the modernization plan did 
not have the intended effect, however. The 
colonialists had built a system of more than 
15 kilometres of underground pipes that 
flushed waste away in the European quarter — 
an improvement inspired by the discovery in 
the late nineteenth century that proper sewers 
could avert cholera outbreaks by preventing 


the contamination of drinking water. The 
old quarter, by contrast, made do with a basic 
drainage system that dumped untreated waste 
into the nearby Red River. During the floods, 
which commonly occurred during the rainy 
season, the streets in the old city were filled 
with human faeces. 

But the same sewage pipes that spared the 
colonialists from cholera also served as an ideal 
breeding ground and mass transportation sys- 
tem for the rat population — harbingers of other 
serious diseases such as the bubonic plague. 
Hanoi’ wealthiest homes, equipped with indoor 
plumbing amenities such as running water 
and flushing toilets, became overrun with the 
rodents as they left their subterranean homes in 
search of food. Between 1906 and 1908, plague 
killed at least 263 people in colonial Hanoi. 
Bubonic plague remained a risk through- 
out French colonial rule and the rat problem, 
although effectively managed, persists today. 

By trying to solve one threat to urban health, 
the French inadvertently introduced another. 
This pattern has recurred throughout the 
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history of modern cities, as urban policies and 
interventions perpetually yield unpredicted, 
and sometimes dire, consequences. This is 
typical ofa network as physically and socially 
complex as a city. Implementing policies to fix 
what's broken without introducing a new set 
of problems is in many ways the central chal- 
lenge of urban improvement. A way of think- 
ing borrowed from theoretical physics may be 
the best chance that urban planners have of 
overcoming it. 


COMPLEX MODELS 
Cities are complicated. They comprise large 
numbers of people, and the many ecologi- 
cal, cultural, social and economic entities that 
make up their environment. All these factors 
interact in time and space to form complex 
systems that constantly evolve in response to 
changes in climate, environment and people. 
The characterization of cities as complex 
systems is based on work that originated in an 
entirely different field. Physicist and Nobel lau- 
reate Philip W. Anderson proposed complexity 


JAN KALLWEJT 


theory in his 1972 article ‘More Is Different’ in 
an effort to understand the “shift from quanti- 
tative to qualitative differentiation” in his dis- 
cipline of many-body physics (P. W. Anderson 
Science 177, 393-396; 1972). The behaviour of 
complex aggregates of elementary particles such 
as the atomic nucleus could not be understood 
solely in terms of the properties of their individ- 
ual components. Instead, Anderson reasoned, 
at each level of complexity entirely new prop- 
erties appear, and each level of understanding 
requires a new conceptual structure. An atom 
in isolation can be thought of as “a featureless, 
symmetrical little ball’, he wrote, but the pres- 
ence of other particles and environmental con- 
ditions changes the shape and reactivity of that 
atom. Failure to take these changes into account 
renders the understanding of any measurable 
observations difficult, if not impossible. 

The same is true of cities. As one expands 
from the perspective of an individual urban 
dweller to the neighbourhood or town level, new 
interactions emerge and the complexity of the 
network increases. Since its inception, Ander- 
son's theory of complex systems has spread 
beyond particle physics to a variety of other dis- 
ciplines, including biology and economics. 

One of those applying systems thinking to 
urban environments is Luis Bettencourt, a com- 
plex systems specialist at the Santa Fe Institute 
in New Mexico. Like Anderson, Bettencourt 
trained as a theoretical physicist — he used 
empirical data and statistics to model the early 
Universe. But his fascination with cities and 
complex social organizations brought him to 
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the institute's Cities, Scaling and Sustainability 
project, an interdisciplinary effort to develop a 
general theory of urbanization that is capable of 
describing cities quantitatively. 

“We create models of a city at the systems 
level, he explains, “in order to describe the 
physical and social networks of people and busi- 
nesses, and understand how those mesh with 
urban space and infrastructure. Just imagine the 
math of that?” 

By analysing as much data about cities as 
possible — everything from crime statistics to 
efficiency of public transport — Bettencourt 
and his colleagues have been able to identify 
key variables and interactions to formulate a 
picture of an urban system. These models have 
allowed the team to demonstrate that many of 
the factors affecting urban well-being are tied, 
in remarkably predictable ways, to the size of 
a city; as population grows, so too does crime, 
for instance. Modelling is also useful in under- 
standing how well-meant health interventions 
in cities can have unforeseen effects. Although 
models of complex urban systems can’t pro- 
vide planners with specific policies to enact to 
improve health and well-being, they can help to 
explain one of the trickiest properties of com- 
plex systems: feedback. 


LOOPED THINKING 

Complex systems such as cities are alive with 
feedback loops. The effects of an intervention 
in one area can produce changes in another, 
which can, in turn, either amplify or oppose 
the original intervention. The simplest exam- 
ple of a feedback loop is a thermostat: as the 
temperature drops below the target level, 
the heating clicks into life and warms up the 
room. The increased temperature then feeds 
back, and the heating is switched off. 

In the case of the thermostat, the loop is 
by design. But in cities, unexpected loops 
abound. Once again, Hanoi provides a classic 
example. Faced with the rat problem, in 1902 
French colonialists began to pay rat catch- 
ers to reduce the population. Wages were 
dependent on the number of severed tails 
presented to the authorities. But the rodent 
bounty did not work as planned — some 
made the most of the financial opportunity 
and began breeding rats for their tails. 

Feedback loops can also be seen in cities 
that expand their road networks to tackle traf- 
fic congestion. Although the additional road 
capacity does initially reduce congestion, over 
time the effect is usually to entice more peo- 
ple to drive, and consequently to bring about 
more traffic jams, a higher risk of accidents 
and increased vehicle emissions. Such a loop 
cannot be described in isolation, however. 
The effects of the intervention can be seen 
elsewhere. Where roads are built can affect 
decisions about housing and energy, and the 
time spent commuting in traffic can reduce 
the time spent exercising, increasing the level 
of obesity (see ‘Making connections’). 


As is the case with any public expenditure, 
most would agree that urban policy should 
ideally be evidence based. But with seemingly 
endless loops of cause and effect to negotiate, 
determining whether an urban policy has been 
a success or not can be tricky. 

There are many different methods used by 
the public, private and social sectors of soci- 
ety to evaluate policy impact “which must 

be adapted to each 


“We’re looking specific case and each 
at issues of specific problem,” 
health today, says Ernesto Amaral, 
such as those a sociologist at the 
related to RAND Corporation in 
dependence Santa Monica, Califor- 
on motor nia. When measuring 


the impact of a policy, 
he says, the ideal sce- 
nario is to compare a group that receives the 
policy with a ‘control group that doesn't. It is 
a simple enough concept, but as the Hanoi- 
sewer story illustrates, even in cases in whicha 
control group is available, policies do not have 
binary outcomes — a single policy can have 
multiple effects, good and bad. 

Another problem is how to demonstrate 
that a policy works when the outcomes might 
take decades or longer to appear. There can 
be multiple outcomes and the policymak- 
ers responsible for the initial intervention 
may have moved on by the time those out- 
comes occur. “It gets complicated to corre- 
late back to the specific actions that actually 
caused them,” says Barry Newell, a systems 
dynamicist at the United Nations University 
International Institute for Global Health 
(UNU-IIGH) in Kuala Lumpur, Malaysia. 

When faced with having to evaluate such 
long-term effects, historical studies may prove 
crucial. “We can use history to understand 
what has happened as the result of actions in 
the past, and then project that into the future,” 
says Newell. An example of this type of research 
isa pilot project run by historian Katrina Proust 
in George Town, a city on the Malaysian island 
of Penang. Proust will lead an interdisciplinary 
team from the UNU-IGH, including Newell. 
The researchers will apply the concepts of 
systems thinking and feedback dynamics 
to an analysis of almost 200 years of histori- 
cal archive data — from 1786, when the city 
was established, to 1985, when the first bridge 
connecting Penang island with the mainland 
was completed. Proust hopes to model policy 
and decision-making behaviour by combin- 
ing numerical data with well-documented, 
qualitative observations. The goal is to better 
understand the relationship between urban 
administration, planning and public health. 
“We're looking at issues of health today, such as 
those related to dependence on motor vehicles,” 
says Proust, “as well as traditional urban prob- 
lems of the tropics like sanitation and water 
quality,’ In general, these types of small-scale, 
locally focused projects detect feedback effects 


vehicles.” 
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MAKING CONNECTIONS 


Causal-loop diagrams illustrate the interactions between variables 
within a complex human urban system. This diagram shows how 


Urban sprawl 


Road 
capacity 


feedback loops related to commuting, exercise, stress and diet may 


interact to influence the level of obesity within a city. The links 
between variables can be positive (yellow) or negative (blue). 


Where there is a positive link, any change to the first variable will 
see the next one change in the same direction. For example, an 
increase in road capacity will increase urban sprawl, and, likewise, 
a decrease in capacity will decrease sprawl. Negative links cause 


the second variable to move in the opposite direction to the first. 
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more quickly than large-scale global studies, 
because response times are relatively short and 
the people and variables within the system are 
better identified. 


A COMMON LANGUAGE 

By identifying the interactions within urban 
systems, it is hoped that real-world policy can 
be changed to improve the health of cities. 
“We want to scientifically understand and 
model complex systems in cities, to evalu- 
ate what the multiple components are and 
how they interact, in order to predict health 
and well-being in urban environments,’ says 
Franz Gatzweiler, director of the International 
Council for Science's Urban Health and Well- 
Being programme based at the Institute of 
Urban Environment in Xiamen, China. The 
programme supports research projects that use 
a systems approach to generate useful products 
for policymakers, because although projects 
such as Proust’s can improve the understand- 
ing of systems, there are few examples of this 
type of approach being applied. 

One group committed to this approach with 
some success is InWithForward, an organiza- 
tion based in Vancouver, Canada, that com- 
bines systems analysis and modelling with 
engagement of communities at the local level. 
They identify the shortcomings of existing 
urban policies by observing the system from 
the point of view of local individuals, and then 
work with these people to fill the gaps within 


the system. For example, by observing people 
with cognitive disabilities in Vancouver, the 
group discovered that although the medical 
care was adequate, most of these people had 
mundane and repetitive daily routines, and 
that their quality of life was poorer because 
of it. The organization’s solution was to con- 
nect these individuals with other members of 
their community who could provide diverse 
opportunities and learn- 


ing experiences — from “Our 
art and music teachers, capacity 
to carpenters and animal 

. « to act 
trainers. “Most of the data 7 
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problem is, but don't help 

to illuminate what the solutions would be,” 
says Sarah Schulman, founder of the organi- 
zation. Rather than major top-down changes, 
Schulman’s approach focuses on creating new 
interactions and networks within the existing 
urban system, promoting a shared under- 
standing among both policymakers and urban 
dwellers of how the system works, and what 
can be done to change things. 

But shared understanding is not a simple 
thing to achieve, especially between urban 
planners who tend to treat areas of the city 
separately. “We all talk different languages,” 
says Gatzweiler. Different disciplines have 
their own vocabularies, and it is entirely pos- 
sible for miscommunication to pass unnoticed. 
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For example, despite physicist Newell and 
historian Proust working and living together 
(the two researchers are married), they only 
realized that they were each defining a term 
often used in their work entirely differently 
after five years on a particular project. 

“Our capacity to act collectively is lagging,” 
says Gatzweiler. Overcoming the silos into 
which different aspects of the city network are 
divided — water, transport, health and so on 
— is acentral challenge of urban planning. In 
a system as complex as a city, no one person 
or speciality can see the whole picture; urban 
health is an interdisciplinary problem that 
requires collaboration across administrative 
and scientific barriers. Once again, Anderson’s 
theory of complexity makes the way ahead 
clear. Rather than have each discipline focus on 
making inroads into problems by themselves, 
Anderson wrote, “we should recognize that 
such roads, while often the quickest shortcut 
to another part of our own science, are not vis- 
ible from the viewpoint of one science alone.” 

Threats to urban health and well-being span 
disciplinary boundaries, and systems thinking 
offers a tool by which these boundaries can be 
bridged, says Gatzweiler “We need systems 
thinking and the co-production of knowledge 
in order to create healthier, wealthier cities, 
and people within them,” m 


Kevin Pollock is a freelance journalist based 
in Tel Aviv-Yafo, Israel. 
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Future flood losses in major coastal cities 


Stephane Hallegatte’?*, Colin Green*, Robert J. Nicholls* and Jan Corfee-Morlot® 


Flood exposure is increasing in coastal cities’? owing to 
growing populations and assets, the changing climate*, and 
subsidence*°. Here we provide a quantification of present and 
future flood losses in the 136 largest coastal cities. Using a 
new database of urban protection and different assumptions 
on adaptation, we account for existing and future flood 
defences. Average global flood losses in 2005 are estimated 
to be approximately US$6 billion per year, increasing to 
US$52 billion by 2050 with projected socio-economic change 
alone. With climate change and subsidence, present protection 
will need to be upgraded to avoid unacceptable losses of 
US$1 trillion or more per year. Even if adaptation investments 
maintain constant flood probability, subsidence and sea-level 
rise will increase global flood losses to US$60-63 billion per 
year in 2050. To maintain present flood risk, adaptation will 
need to reduce flood probabilities below present values. In 
this case, the magnitude of losses when floods do occur 
would increase, often by more than 50%, making it critical 
to also prepare for larger disasters than we experience 
today. The analysis identifies the cities that seem most 
vulnerable to these trends, that is, where the largest increase 
in losses can be expected. 

A first screening study’ provided a global overview of flood 
exposure in world coastal cities. The exposure metric can be 
viewed as a worst case scenario, but it does not estimate 
average annual losses, which is a standard metric in disaster 
risk management planning. To do so, it is necessary to take 
into account infrastructure-based adaptation (for example, dykes) 
and the vulnerability of populations and assets. Here, we assess 
economic average annual losses (AAL) in 136 coastal port cities, 
using a method developed for assessing city-level flood risk’ and a 
new database of urban coastal protection (Methods). 

Present aggregated average annual flood losses in the 136 cities 
are estimated at approximately US$6billion per year. Table 1 
ranks the most vulnerable cities in 2005 using two different 
metrics of vulnerability. In the left column, the table shows a 
ranking in terms of AAL, taking into account all potential floods 
and existing protection. The AAL estimates can be compared 
to more sophisticated approaches. For instance, the annual 
losses in New Orleans are estimated at US$600 million, close to 
the US$650 million estimates from the Interagency Performance 
Evaluation Taskforce’. In the right column, cities are ranked 
according to relative vulnerability, namely the ratio of AAL to the 
city’s gross domestic product (GDP). This value can be understood 
as the share of the city’s economic output that should be saved 
annually to pay for future flood losses. The 20 cities most vulnerable 
according to this last indicator are also presented in Fig. 1. 

The ranking in terms of exposure includes mainly rich-country 
cities (Supplementary Table S4). On average, however, rich cities 


are better protected than poorer ones, and the ranking in terms 
of absolute flood losses contains more cities from developing 
countries. In relative terms, developing-country cities are even 
more vulnerable, with only three cities from developed countries in 
the top 20 (New Orleans, Miami and Tampa—Saint-Petersburg). 
Moreover the ranking in absolute terms (left column) includes 
mainly capital cities, whereas secondary cities are more often 
represented in the ranking in relative terms (right column). This 
difference suggests that risk management efforts may be lower 
in secondary cities. 

Table 1 shows the importance of existing flood defences: in a 
city such as Amsterdam, exposure is extremely high (US$83 billion 
of assets exposed to the 100-year flood), but AAL do not exceed 
US$3 million, because estimated defence standards are the highest 
that exist globally. On the other hand, a city such as Ho Chi Minh, 
in Vietnam, has a 100 year exposure of only US$18 billion, but the 
lower level of protection means that the city is affected by small 
floods on a frequent basis, resulting in large estimated average 
costs. In relative terms, Ho Chi Minh City has one of the largest 
vulnerabilities, with AAL reaching 0.74% of local GDP. The ratio 
of AAL to local GDP exceeds 1% for two cities, Guangzhou and 
New Orleans. The vulnerability of New Orleans has been reduced 
however by recent post-Hurricane Katrina investments and is likely 
to be reduced further in the near future’. 

Another conclusion from Table 1 is the concentration of losses 
in only a few cities. Only 13 cities have average losses in excess 
of US$100 million, and three American cities (Miami, New York 
City and New Orleans) explain 31% of the global aggregate losses 
in the 136 cities, because of their high wealth and low protection 
level. Adding Guangzhou, the four top cities explain 43% of global 
losses. Also, the US seems particularly vulnerable, with 6 American 
cities in exposure ranking, 8 in the ranking by absolute AAL, and 
3 in the ranking by relative AAL. As coastal flood risks are highly 
concentrated, flood reduction actions in a few locations could 
be very cost-effective. 

To develop possible future patterns of drivers of risk to 2070, 
our analysis introduces three scenarios for socio-economic changes 
and six for environmental change. From there, we retain four main 
scenarios: SEC assumes only socio-economic changes, derived from 
OECD and UN scenarios; SEC-S adds subsidence to scenario SEC 
(40cm in 2050 in the cities subjected to subsidence); and SLR-1 
and SLR-2 add optimistic and pessimistic sea-level rise scenarios 
to SEC-S, respectively (with 20cm and 40cm in 2050). Here, we 
report results for 2050, but results for 2030 and 2070 are available 
in the Supplementary Information. 

With no adaptation, the projected increase in average losses 
by 2050 is huge, with aggregate losses increasing to more 
than US$l trillion per year in scenarios SLR-1 and SLR-2 
(Supplementary Table S6). All cities experience a similar increase 
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Table 1| City ranking by risk (AAL) and relative risk (AAL in percentage of GDP) for 2005. 


Ranking by AAL (US$ million) Ranking by relative AAL (percentage of city GDP) 
Urban 100 year AAL, with AAL, with Urban 100 year AAL, with AAL, with 
agglomeration exposure protection protection agglomeration exposure protection protection 
(US$ million) (percentage (US$ million) (percentage 
of GDP) of GDP) 

1 Guangzhou 38,508 687 1.32% 1 Guangzhou 38,508 687 1.32% 

Z Miami 366,421 672 0.30% 2 New Orleans 143,963 507 1.21% 

3} New York—Newark 236,530 628 0.08% 3 Guayaquil 3,687 98 0.95% 
4 New Orleans 143,963 507 1.21% 4 Ho Chi Minh City 18,708 104 0.74% 
5 Mumbai 23,188 284 0.47% 5 Abidjan 1,786 38 0.72% 
6 Nagoya 77,988 260 0.26% 6 Zhanjiang 2,780 46 0.50% 
7] Tampa—St. Petersburg 49,593 244 0.26% 7 Mumbai 23,188 284 0.47% 
8 Boston 55,445 237 0.13% 8 Khulna 2,073 13 0.43% 
9 Shenzen 11,338 169 0.38% 9 Palembang 1,161 27 0.39% 
10 Osaka—Kobe 149,935 120 0.03% 10  Shenzen 11,338 169 0.38% 
1 Vancouver 33,456 107 014% 11 Hai Phong 6,348 19 0.37% 
12 Tianjin 11,408 104 0.24% 2 N’ampo 507 6 0.31% 
33 Ho Chi Minh City 18,708 104 0.74% 13 Miami 366,421 672 0.30% 
14 Kolkata 14,769 99 0.21% 14 Kochi 855 14 0.29% 
15 Guayaquil 3,687 98 0.95% 15 Tampa—St. Petersburg 49,593 244 0.26% 
16 Philadelphia 22,132 89 0.04% 16 Nagoya 77,988 260 0.26% 
17 ‘Virginia Beach 61,507 89 0.15% 17 Surat 3,288 30 0.25% 
18 Fukuoka—Kitakyushu 39,096 82 0.09% 18 Tianjin 11,408 104 0.24% 
19 Baltimore 14,042 76 0.08% 19  Grande_Vitoria 6,738 32 0.23% 
20 Jakarta 4,256 73 014% 20 Xiamen 4,486 83) 0.22% 


A comparison with a ranking by exposure is proposed in the Supplementary Information. 


15: Tampa-St. 


Petersburg 17: Surat 


Ky! 


ag ae. rae Q 4 ( A 10: Shenzen 
a) 
je ; : 
~) by 


TENT caret | 


14: Kochi 
L/ gt Z 


3: Guayaquil « 
9: Palembang 
19: Grande Vitoria 


Ls 


4: Ho Chi Minh City 


@ AA-1% of local GDP O Aat>0.5% of local GDP 
© AAL>0.75% of local GDP © AAL0.25% of local GDP 


Figure 1| The 20 cities where the relative risk is larger in 2005, that is, where the ratio of AAL with respect to local GDP is the largest. More information 
in Table 1. 
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Table 2 | The 20 cities with the highest loss in 2050, assuming scenario SLR-1 and adaptation option that maintains flood 
probability (option PD). 


Scenarios with socio-economic change, subsidence, sea-level 
rise and adaptation to maintain flood probability (scenarios 
SLR-1, and adaptation option PD) 


Scenarios with socio-economic 
change alone (SEC) 


Urban agglomeration AAL (US$ million) AAL (per- AAL (US$ million) Increase in AAL AAL (percentage of city GDP) 
centage of compared with 
city GDP) 2005 (%) 
Guangzhou (S) 11,928 1.32% 13,200 1% 1.46% 
Mumbai 6,109 0.47% 6,414 5% 0.49% 
Kolkata (S) 2,704 0.21% 3/350) 24% 0.26% 
Guayaquil (S) 2,813 0.95% 3,189 13% 1.08% 
Shenzen 2,929 0.38% 3,136 7% 0.40% 
Miami 2,099 0.30% 2,549 21% 0.36% 
Tianjin (S) 1,810 0.24% 2,276 26% 0.30% 
New York—Newark 1,960 0.08% 2,056 5% 0.08% 
Ho Chi Minh City (S) 1,743 0.74% 1,953 12% 0.83% 
New Orleans (S) 1,583 1.21% 1,864 18% 1.42% 
Jakarta (S) 1,139 0.14% 1,750 54% 0.22% 
Abidjan 826 0.72% 1,023 24% 0.89% 
Chennai (Madras) 825 012% 939 14% 014% 
Surat 905 0.25% 928 3% 0.26% 
Zhanjiang (S) 806 0.50% 891 1% 0.55% 
Tampa—St. Petersburg 763 0.26% 859 13% 0.29% 
Boston 741 0.13% 793 7% 0.14% 
Bangkok (S) 596 0.07% 734 23% 0.09% 
Xiamen (S) 572 0.22% 729 27% 0.29% 
Nagoya (S) 564 0.26% 644 14% 0.30% 


‘S' indicates that the city is prone to significant subsidence. Most of these cities are located in deltaic regions, where subsidence influences local sea level in 2050. 


in risk. In the absence of adaptation, the impact of environmental 
change is much larger than the effect of socioeconomic change. 
These numbers should not be considered as predictions, but they 
demonstrate the need for adaptation, because inaction would result 
in unacceptably high losses. 

We then consider adaptation and how it will alter losses. We 
assume first that adaptation action increases coastal flood defences 
to maintain a constant probability of flooding (adaptation option: 
present design, PD). The increase in aggregate AAL is much lower in 
this case. Owing to socio-economic change, there is still a ninefold 
increase in aggregate losses, from US$6 to US$52 billion per year, 
but this is made more manageable by the fact that these cities 
are also much richer. However, rising water levels still increase 
AAL: subsidence by 12% and sea-level rise by an additional 2-8%, 
reaching between US$60 and 63 billion per year. 

Table 2 shows the top 20 cities in terms of AAL in 2050 in the 
scenario with subsidence and optimistic sea-level rise (SLR-1), with 
adaptation to maintain present flood probability. Guangzhou re- 
mains the most vulnerable city, with AAL exceeding US$13 billion. 
With socio-economic change alone, AAL in Guangzhou would be 
around US$12 billion per year in 2050 (a 17-fold increase in abso- 
lute terms). Subsidence and sea-level rise are thus responsible for an 
additional 10% increase, that is, a 10% increase in the AAL-to-GDP 
ratio. Indeed, even if the probability of coastal flooding is un- 
changed thanks to upgraded coastal defence infrastructure, the fact 
that a larger share of existing assets is protected by these defences 
means that annual losses will rise relative to local GDP. For instance, 
sea-level rise and subsidence increase the AAL-to-GDP ratio by 54% 
in Jakarta and by 24% in Abidjan even if present flood probabilities 
are maintained thanks to better defences. In other words, the world 
sees no more floods, but each flood is more destructive owing to 
sea-level rise and subsidence, even with better defences. This effect 


reinforces a trend that can be expected from socio-economic change 
alone, even in the absence of environmental change.!” 

Figure 2 shows the 20 cities where the increase in average 
annual losses between 2005 and 2050 is greatest in relative terms; 
detailed numerical values are provided in Supplementary Table S7. 
In Alexandria, for instance, maintaining flood probability leads 
to an increase by 154% in AAL. These most vulnerable cities 
are distributed all over the world, with a concentration in the 
Mediterranean Basin, the Gulf of Mexico and East Asia. Even 
though absolute levels of risk are sometimes low in these cities, they 
can be considered as adaptation hotspots because this is where flood 
risks are likely to increase the most in relative terms. 

To avoid any increase in risk, an adaptation policy needs to do 
more than maintain present flood probability. Rather, maintaining 
present levels of risk (relative to local GDP) in the context of 
rising sea levels, subsidence and socio-economic changes requires 
adaptation policy that reduces flood probability over time. In the 
adaptation option termed present losses, an upgrade in defence 
is thus calibrated to cancel the impact of environmental changes 
and to maintain present losses on average relative to local wealth, 
keeping aggregate losses at US$52 billion. 

For each city, we estimate the increase in defence standard that 
would maintain the relative risk level (that is, keep constant the 
ratio of average annual losses to local GDP). The required increase 
in protection is larger than local sea-level rise. For instance, in 
Alexandria, protection needs to be raised by 67cm, for a 60cm 
rise in local sea level; this corresponds to moving from a 100-year 
design standard to a 270-year design standard (that is, a division by 
2.7 of the probability of flooding). In other cities (Supplementary 
Table S7) the increase in dyke height is between 2 and 8cm 
larger than sea-level rise. This increase corresponds to a significant 
increase in the standard of protection, that is, to a large decrease 
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Figure 2 | The 20 cities where AAL increase most (in relative terms in 2050 compared with 2005) in the case of optimistic sea-level rise, if adaptation 
only maintains present defence standards or flood probability (PD). More information in Supplementary Table $7. 


in the probability of flooding (for example, a division by 1.6 in 
Jakarta and Shanghai). 

Even if the relative risk level is maintained, flood consequences 
would be much higher in a world with sea-level rise and better 
protection. In Alexandria in 2050, for instance, the probability 
of a flood may decrease by 2.7 owing to better defences, but 
if a flood occurs the total losses triple from US$17 billion (with 
socio-economic change alone) to US$51 billion, a tripling due to 
environmental changes alone. Results for other cities are presented 
in Supplementary Table S7. Many cities would experience losses 
that are more than 50% larger if an event exceeds the post- 
adaptation protection level. Hence, the world’s coastal cities become 
more dependent on flood defences, but also more vulnerable once 
failure or overtopping occurs. 

Finally, Table 3 looks at present city characteristics that influence 
the vulnerability in 2050. It shows that cities that grow rapidly, have 
large populations, are poor, exposed to tropical storms, and prone 
to subsidence are over-represented in the top 20 for absolute AAL 
(the cities from Table 2). However, these drivers are not as relevant 
for the relative increase in annual average loss with adaptation 
option PD (the cities from Fig. 2). Only subsidence seems to be a 
consistently good determinant of vulnerability for both absolute 
and relative measures of change, with twice as many cities with 
subsidence in the top 20 based on the two indicators (absolute 
and relative losses). The cities most vulnerable in relative terms 
may thus not be the ones suggested by the present situation and 
historic floods, nor are they the ones that necessarily attract the most 
research and analysis today with respect to managing risk. 

While recognizing the limitations and uncertainties in this 
analysis, three important policy conclusions can be drawn that 
are robust across a range of plausible assumptions. First, failing 


to adapt is not a viable option in coastal cities. It is difficult 
to estimate the cost of these adaptation options, as it depends 
on the specific context for each city and on selected approaches 
and technologies. On the basis of anecdotal evidence from 
a few cities'''*, a few billion US dollars per city in initial 
investment—plus approximately 2% of the initial investment cost 
in annual operation and maintenance costs—is the possible order 
of magnitude for adaptation costs'®. Hence, indicative annualized 
values with 5% interest rates are about US$350 million per year 
per city, or approximately US$50 billion per year for our 136-city 
sample. These estimated aggregate adaptation costs are far below 
our estimate of aggregate damage losses per year in the absence of 
adaptation, and of the same order of magnitude as residual losses 
with adaptation (Supplementary Table S6). These estimates include 
only flood risks and do not encompass all weather risks that these 
cities face; see, for instance, ref. 16 for a global analysis of wind 
damage from tropical cyclones. 

Second, managing coastal flood risk requires doing more than 
maintaining today’s standard of protection (and present probability 
of flooding). In practice, probability of flooding will need to be 
reduced to maintain flood risks at today’s levels. 

Last, improving standards of protection could maintain or 
reduce risk levels and decrease the number of floods, but the 
magnitude of losses when floods do occur will still increase. 
This result points to the limitations of what infrastructure-based 
adaptation can achieve. As illustrated by the recent landfall of 
hurricane Sandy on the east coast of the United States, there is a 
need to prepare at the local, national and international level for 
larger floods and the disasters that ensue. Such preparations can 
include strengthening disaster planning measures, including early 
warning and evacuation systems, more comprehensive insurance 
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Table 3 | The fraction of cities in the five categories (fast growing, large population, low income, subject to tropical storms and 
subject to subsidence) in the full sample of 136 cities, and in two top 20 categories using different indicators. 


Fast-growing city Large population Low income Subject to Prone to 
(local GDP growth in (larger than (GDP per capita hurricane and significant 
2005 larger than 5% 5 million in in 2005 less than tropical storms subsidence 
per year) 2005) US$5,000) 

All 136 cities 40% 19% 27% 51% 27% 

Top 20 in terms of AAL in 65% 55% 40% 95% 55% 

2050 (adaptation option 

PD; Table 2) 

Top 20 in terms of increase 40% 15% 20% 45% 40% 


in AAL (adaptation option 
PD; Fig. 2) 


schemes and other forms of post-disaster response to quickly 
rebuild affected communities. 


Methods 


Flood risks are analysed following ref. 7. The population exposure is taken from 
a previous analysis’. Exposed population was translated into exposed assets using 
an estimate of produced capital per inhabitant drawing on recent work from the 
World Bank'’. The DIVA database provides information about extreme water 
levels'®. We create a first database for coastal defences; this is based on collected 
evidence on existing defences where possible, and the authors’ expert estimates to 
complete the defence database (Supplementary Information). 

Then, we calculate the probability of different flood levels in each city (within 
the flood defences) using three simple models for defence failure or overtopping. 
Even though absolute risk levels depend on the failure model, the relative effect of 
sea-level rise and subsidence on losses is relatively robust to this uncertainty. 

From the probability of the various flood levels in the city, and from data on 
assets that are exposed at different flood levels, flood asset losses are estimated using 
depth—-damage functions for 6 categories of assets. 

To assess future losses, we use two socio-economic scenarios!?: an OECD-based 
growth scenario in which urban populations grow at the same rate in all cities, fol- 
lowing an extrapolation of UN urbanization scenarios; and the OECD-based growth 
scenario in which city population is capped at 35 million inhabitants. We assume 
that future assets in the city have the same elevation distribution as existing assets. 

The six scenarios of environmental change combine: two assumptions on 
subsidence (no subsidence, or a 40 cm subsidence in 2050 in all cities subject to 
it); and three assumptions about sea-level rise (none, 20 cm, or 40 cm in 2050). 
Combined with socio-economic scenarios and adaptation options, these lead to 
108 scenario combinations. Results for the 108 scenarios and input data and model 
codes are available in the Supplementary Information. The analysis presented 
here considers only four scenarios of the total number of scenarios (see text). All 
scenarios use the simplest defence failure model, the maximum protection level, 
and constrain cities to no more than 35 million inhabitants. 
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Two-stroke scooters are a dominant source 
of air pollution in many cities 
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R. Zimmermann*, N. Marchand®, S. Hellebust®, B. Temime-Roussel®, U. Baltensperger! & ASH. Prévot! 


Fossil fuel-powered vehicles emit significant particulate matter, for example, black carbon and 
primary organic aerosol, and produce secondary organic aerosol. Here we quantify secondary 
organic aerosol production from two-stroke scooters. Cars and trucks, particularly diesel 
vehicles, are thought to be the main vehicular pollution sources. This needs re-thinking, as we 
show that elevated particulate matter levels can be a consequence of ‘asymmetric pollution’ 
from two-stroke scooters, vehicles that constitute a small fraction of the fleet, but can 
dominate urban vehicular pollution through organic aerosol and aromatic emission factors up 
to thousands of times higher than from other vehicle classes. Further, we demonstrate that 
oxidation processes producing secondary organic aerosol from vehicle exhaust also form 
potentially toxic ‘reactive oxygen species’. 
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climate”. Road vehicles are a significant source of PM, 

particularly in urban areas. A number of recent studies have 
shown that a large fraction, possibly the largest, of vehicular PM 
is secondary organic aerosol (SOA) produced via atmospheric 
oxidation of precursor gases in the exhaust*°. Thus, 
understanding vehicular air pollution requires an assessment of 
SOA formation from different vehicle types. Two-stroke (2S) 
scooters (powered two-wheeled vehicles with engine 
displacement <50cm?) are popular globally, particularly in 
Asia, Africa and Southern Europe. Despite being high emitters of 
primary PM®’, regulations for scooters are generally less 
stringent than for other vehicles, for example, in Europe having 
reached Euro 5/V (a fifth tranche of regulations), for passenger 
cars and trucks, versus only Euro 2 for scooters (see 
Supplementary Table 1 and ref. 8). Accordingly, a scientific 
report to the European Commission suggests that scooters will 
emit more volatile organic compounds (VOCs) than all other 
vehicles combined in Europe by 2020 (ref. 9). Furthermore, high 
PM levels and toxic aromatic hydrocarbons, important SOA 
precursors!°, have been observed in many cities, especially in 
Asia!!. Globally, organic aerosol (OA) dominates PM, with SOA 
accounting for the largest fraction‘. 

Here we show that 2S scooters emit significant amounts of 
primary organic aerosol (POA), aromatic VOCs and also produce 
significant SOA. We use the term ‘asymmetric polluter’ to 
describe these vehicles as their emission factors (EFs) and 
evidence from air quality measurements before and after bans on 
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scooters in Asian cities suggest they may dominate vehicular 
pollution despite their relatively small numbers. Chemical 
analysis of the emissions shows that SOA is mainly produced 
via photo-oxidation of aromatic VOCs, present in gasoline, from 
the exhaust. This shows that the known issue of incomplete fuel 
combustion during the 2S cycle is also responsible for SOA 
formation. Finally, we present the first online measurements of 
aged exhaust showing that SOA formation also produces reactive 
oxygen species (ROS) with potentially detrimental effects on 
our lungs. 


Results 

Emission Factors. We investigated POA emissions and SOA 
formation from 2S scooters and their potential health effects. The 
oxidation of VOCs in 2S scooter emissions produces significant 
SOA (g carbon (C) kg~! fuel), with total OA on average 2.9 and 
2.4 times higher than POA after aging for idling and driving 2S 
scooters, respectively (Fig. 1, and Supplementary Table 2 ). In 
addition, substantial toxic aromatic emissions (up to ~40% of 
emitted VOC volume for the scooters of this study) of benzene, 
toluene and C2-C4 alkylated benzenes, which are recognized 
SOA precursors!°!?, are present in the exhaust. Among the 
aromatics, benzene is of particular concern due to its 
carcinogenicity. Levels in the raw 2S scooter exhaust were as 
high as 300,000 p.gm~? or 146 p.p.m.(v) from idling. The EU 
annual mean limit for the protection of human health is 
5ugm? (ref. 13), while the US National Institute for 
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Figure 1 | Emission factors from scooters and other vehicles. EFs plotted as box-and-whiskers (median line, red; 25th and 75th percentile, box; 10th and 
90th percentile, whiskers) of (a) POA, (b) aged OA (POA + SOA formation), (¢) benzene and (d) light aromatics (benzene, toluene and C2-C4 alkylated 
benzenes). Points shown next to the box-and-whiskers are the individual data points, coloured depending on measurement region for ambient data. 2S 
scooters (this study, n= 3) were run in idle or during driving cycles (ECE47). Data on the other vehicles shown are from the literature (Supplementary 
Table 3) for light-duty and heavy-duty vehicles (LDVs and HDVs). LDVs data are further divided between vehicles meeting Euro 5 and those not meeting 
Euro 5, labelled <Euro 5 in parenthesis. Ambient data are split according to a contribution of HDVs to the data of higher than 

or lower than 50%. Note that many of the higher ambient values are from older vehicle studies (Supplementary Table 3). 
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Occupational Safety and Health recommends that workers wear 
special breathing equipment when exposed to benzene at levels 
exceeding 1p.p.m. for 15min. Waiting in traffic behind a 2S 
scooter, for example, at junctions and while the scooter is idling, 
may therefore be highly deleterious to health. 


Secondary organic aerosol yields. The contribution of the 
aromatics to SOA formation was estimated by calculating an 
apparent aerosol yield, Yappareny. assuming all SOA comes from 
aromatic precursors: 


Yapparent = Csoa/ x Aj (1) 


where Csgoa is the SOA produced (ug m~ 3) for a given mass 
change in aromatic i (A;, i= benzene, toluene or C2-C4 alkylated 
benzenes). Apparent yields closely match average concentration- 
weighted literature aromatic SOA yields!® (Fig. 2a, 
Supplementary Note 1) for idling, complete ECE47 driving 
cycles and ECE47 phase one (Ph1), indicating that most SOA is 
from aromatic precursors (Fig. 2a). SOA from ECE47 phase two 
(Ph2) alone is underestimated by equation (1), suggesting SOA 
production from unidentified compounds, emitted by the hot 
engine. Note that the total emission during a full cycle is 
dominated by Phl, that is, by cold engine emissions. 
Furthermore, a “Van Krevelen diagram’ illustrates the aging of 
2S scooter emissions, from oxygen to carbon (O:C)~0 to 
O:C~ 0.6. This elemental composition is consistent with that of 
previously observed SOA from aromatic precursors! (Fig. 2b). 
We therefore conclude that SOA formation from 2S scooter 
emissions is likely from the oxidation of aromatics, in contrast to 
diesel SOA, which is predominantly from other precursors!?. 


Comparison to other vehicle types and ambient data. Figure 1 
also shows laboratory and ambient measurements of POA, light 
aromatic and benzene EFs from passenger cars and trucks 
(Supplementary Table 3). Ambient data are from roadside/tunnel 
measurements in the US, EU and Asia, and are split according to 
the fraction of light-duty and heavy-duty vehicles (LDVs and 
HDVs, respectively) at the measurement site. Note that the 
general trend is for lower EFs in newer studies, consistent with 
improvements in emission controls. Also shown are data from 
Indian in-use 2S autorickshaws for comparison to the European 
scooters of this study. Caution is required in such a comparison; 
however, although similar (both have 2S engines), these are a 
different vehicle class and were furthermore tested during a dif- 
ferent driving cycle. In general, ambient EFs from Asian vehicles 
are in the same range as European and US vehicles, while 
emissions from in-use 2S rickshaws are slightly higher than from 
the European scooters of this study. POA emissions from 2S 
scooters are on average around 20 (maximum 2,780) times higher 
than ambient (light-duty dominant) values, and aged OA emis- 
sions on an average 53-771 times higher than laboratory studies 
on other vehicle types. It should be noted that absolute aerosol 
concentrations can influence EFs: higher measurement con- 
centrations would lead to higher EFs!°. SOA formation is most 
significant from idling scooter emissions, while smaller at higher 
engine loads. However, POA emissions are higher under the latter 
conditions, and the aggregate POA + SOA emission at high load 
is comparable with that from idling. 


Reactive oxygen species. We also examined the health implica- 
tions of the 2S scooter SOA (other than those from the mass 
increase) using online measurements of particle-bound, water 
soluble reactive oxygen species (ROS)!°, which are linked to 
negative health effects!?, ROS are undetectable in POA, but 
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Figure 2 | Contribution of aromatic oxidation to two-stroke scooter 
secondary aerosol formation. (a) Apparent SOA mass yields, Yapparent 
(equation (1)), as a function of suspended OA concentration (Coq). Error 
bars show the sensitivity of Yapparent to the chamber wall-loss factor, + one 
S.d. Yapparent for a Euro 1 and two Euro 2 2S scooters are shown in red, blue 
and orange, respectively. Ph 1 and Ph 2 are the first and second phases of 
the ECE47 driving cycle, | and L refer to idling and simulated low power, 
respectively. A predicted yield, concentration-weighted, for the mixture of 
all aromatics (Supplementary Note 1), is given in green triangles. The 
shaded region denotes the range between maximum (low NOx) and 
minimum (high NOx) SOA yields for m-xylene, a major aromatic 
constituent of gasoline. (b) Elemental ratios of OA emissions for the Euro 1 
(squares) and a Euro 2 (triangles) scooter as a function of photochemical 
age. Elemental ratios observed for xylene’? and ambient?! SOA are shown, 
orange and purple, respectively. 


accounts for 0.5-1% carbon in the aged OA, suggesting that PM 
emissions initially become increasingly toxic with aging (Fig. 3). 
Increasing ROS is consistent with the increased O:C ratio of the 
aerosol and in line with a previous study showing increased 
oxidative potential with aging for 2S scooter emissions, albeit at 
aerosol and oxidant loadings much higher than under ambient 
conditions!®. After 1-2h of irradiation, ROS stabilizes or 
decreases, as reported previo for organic peroxides, likely 
due to decomposition processes!?°. 


Discussion 

There are likely several reasons for these relatively large OA and 
aromatic emissions from 2S scooters. First, 2S engines, unlike 
four-stroke (4S), require addition of lubricant oils to the fuel, 
some of which is emitted in the exhaust. Second, during the 2S 
engine cycle some of the fresh fuel/air mixture passes directly 
through the engine”!, increasing VOC emissions, which may 
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explain the high SOA formation. Third, scooters generally utilize 
‘rich combustion’ (low-air/fuel ratio), improving drivability while 
producing higher CO, VOC and PM emissions (but lower NOx). 
Accordingly, the VOC emissions measured here, in particular 
aromatics as found in raw gasoline, are also on average 124 and 
11 times higher from idling and driving 2S scooters, respectively, 
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Figure 3 | Reactive oxygen species in two-stroke scooter emissions. The 
percentage of water soluble reactive oxygen species (ROS) and elemental 
O:C ratios of organic aerosol as a function of time after lights on in the 
smog chamber from Euro 1 (red) and Euro 2 (blue) 2S scooter exhaust 
emissions. ROS concentration measured in moles hydrogen peroxide 
equivalents is normalized to the molar organic carbon concentration per m 
inside the smog chamber to give a percentage. 
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compared with those from other vehicles. Finally, scooter after- 
treatment systems are inherently inefficient due to their relatively 
small size and longer light-off times. 

Precise estimation of a relative contribution to vehicular PM 
and aromatics from 2S scooters is difficult, since vehicle 
regulations vary by country. Another complication arises from 
the possibility of large contributions to OA from a small number 
of super-polluting vehicles (of all types). However, many scooters 
will likely fall into this super-polluting category, especially as a 
considerable number of scooters are in operation in some regions 
without any form of emissions control (note that all scooters 
presented in this study are equipped with two-way oxidation 
catalysts, which reduce emissions of carbon monoxide and 
VOCs) and because emissions may be further exacerbated by 
poor maintenance and tampering, rife for scooters”. 
Furthermore, ambient data in Fig. 1 likely include a number of 
such super-polluting vehicles. Therefore, our results suggest that 
2S scooters are “asymmetric polluters’ of OA and aromatics 
compared with other vehicles. Using the average 2S scooter EF 
(ECE47 driving cycle) in Fig. 1 suggests that 2S scooters 
contribute to around 60% of roadside POA in Bangkok, where 
they account for 10% of fuel consumption (Fig. 4). In a more 
extreme case (comparing the 75th percentile for scooters and 25th 
percentile for ambient light-duty dominated), 2S scooters would 
contribute over 96% to roadside POA. Note that these values are 
based on the European scooters of this study. As Fig. 1 shows, 
emissions from some in-use Asian 2S vehicles may be higher, by a 
factor of three. Since other Asian vehicles are not expected to be 
more polluting based on Fig. 1, higher emissions from in-use 
Asian 2S vehicles would strengthen our conclusion that 2S 
scooters dominate urban pollution in the region. 

Estimation of contributions to aged OA is more difficult as 
vehicular SOA has not been systematically quantified under 
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Figure 4 | Ambient and model data on two-stroke scooters. (a) Share of total fuel consumption by 2S scooters in 2005, 2010 and 2015 from the 


Greenhouse gas Air pollution Interactions and Synergies, GAINS, model. (b) 


Roadside benzene, toluene, ethyl-benzene and xylene (BTEX) before (green) 


and after (blue) banning/restricting 2S scooters in two Chinese cities (¢) Roadside BTEX and number of all vehicles in three Chinese cities, before (green) 


and after (blue) banning/restricting 2S scooters. 
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ambient conditions. However, smog chamber measurements 
suggest average-aged OA contributions to ambient vehicular PM 
of 85% (comparing with LDVs meeting Euro 5) or 98% 
(comparing with LDVs meeting less than Euro 5) from 2S 
scooters. Meanwhile, in the EU, 2S scooters consume only 1% of 
vehicle fuel, Fig. 4. Even with these low numbers, scooters may be 
the major source of some of the vehicle-related pollutants, 
especially in Southern Europe, and our data suggest that reducing 
the numbers of these vehicles would cost-effectively mitigate 
vehicle OA and aromatic emissions, given the alternatives 
available (electric and 4S). In this regard China has taken the 
lead, banning or restricting scooters in many cities since the late 
1990s*3, leading to large decreases in the traffic-related aromatic 
emissions in some Chinese cities (Fig. 4b). Strikingly, roadside 
aromatics are now higher in Dongguan, where scooters are not 
banned, than 60km away in Guangzhou, even though the traffic 
volume is much higher in Guangzhou (Fig. 4c). This result is 
statistically significant: year-to-year benzene, toluene, ethyl- 
benzene and xylene concentrations in Guangzhou were 
229 ugm~ > in 1996, 244ugm~ > in 1999, 290 1gm~? in 2000 
and 150 gm? in 2002, average 228+68 versus 37 gm > 
after the scooter ban in 2005, for example. Benzene, toluene, 
ethyl-benzene and xylene concentrations for Guangzhou and 
other cities reported in the literature are given in Supplementary 
Table 4. 

Our data suggest that 2S scooters are a significant, and in 
many cities the largest, source of vehicular PM and toxic SOA 
and aromatic hydrocarbons, despite being a relatively small 
fraction of the total fleet. Therefore, given the alternative 
technologies available, restrictions on 2S scooters, already 
implemented in China, could improve air quality in many cities 
around the globe. 


Methods 

Measurement campaigns. We combine results from two measurement cam- 

paigns where 2S scooter exhaust was injected through a heated inlet into smog 
chambers***”> to produce SOA via photochemistry. During the first study, an 
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in-use Euro 1 (El) and a new Euro 2 (E2a) 2S scooter were run in idle or simulated 
low power. During the second campaign, emissions from a different Euro 2 2S 
scooter (E2b) were sampled during ECE47 driving cycles”®, Supplementary Table 5 
provides specifications of these vehicles. European (exhaust) emission standards 
are shown in Supplementary Table 1. Average OH concentrations were 

~5 x 10®cm~3. OH concentrations were determined from the decay of a nine 
times deuterated butanol (butanol-D9, 98% Aldrich) tracer as measured using a 
quadrupole proton transfer reaction mass spectrometer (idling 2S scooters) or 
proton transfer reaction time-of-flight mass spectrometer (Ionicon Analytik, 
driving cycle 2S scooters), see also ref. 27. 


Estimation of the NOx regime during experiments. Experiments were under 
‘high NO,’ conditions, which we define as the chemical regime where the main 
reactions of peroxy radicals (RO2) are with NO rather than other peroxy radicals 
(self-reaction, or reaction with hydroperoxy radicals). An estimate of the ratio of 
the RO-forming reactions (RO2+ NO) versus peroxide-forming reactions(RO2+ 
RO), RO, + HO,) is possible for experiments conducted on idling scooters at the 
Paul Scherrer Institute chamber, instrumentation for which includes a NOx 
monitor equipped with a ‘blue light converter’ (ensuring NOx is truly measured as 
NO, + NO). Figure 5b shows the measured concentrations of NO and Os, from an 
experiment conducted on 22 November 2010. This experiment was typical, with an 
initial VOC:NOx ratio of around 50 and continuous addition of NO during 
photochemical aging. 

The concentration of NO as a function of time t is given by: 


d{NO] 
dt 

Where jxoz is the photolysis rate of NO, in the smog chamber (0.01 s~ 1) and 
k, (1.8 x 10~!4cm3 molecule ~!s~!) and ky (7.7 x 10~ !4cm? molecule ~!s~ ) 
are the reaction rate constants for NO with O3 and peroxides (CH3O;) at 298K, 
respectively. Assuming a steady state of NO (only an approximation, Fig. 5 
indicates this point is not reached until around 15h of OH exposure (at OH = 10° 
molecules cm ~ 3)), equation (2) can be written: 


__ jnoz[NO2] — k; [NO][Os] 
ky [NO] 


= jno2 [NO}] —k [NO] [Os] —k, [NO] [RO2 + HO,], (2) 


[RO, + HO}] ; (3) 


Equation (3) suggests NO concentrations at least an order of magnitude 
higher than RO, + HO, (for example, 14 times higher at OH = 10 x 10° 
molecules cm~*h) during the experiment, based on concentrations measured 
inside the chamber (Fig. 5). 

The branching ratio r between the RO2/HO, reactions with NO versus other 
with peroxides (Fig. 5a) is determined using 


k2[NO] 
ky[HO, + RO3] ’ 
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Figure 5 | Estimation of NO, regime. (a) Calculated branching ratio between nitrate and peroxide reactions (orange squares) in the smog chamber 
during aging of emissions from an idling 2S scooter and (b) measured concentrations of ozone (O3) (purple circles) and nitrogen monoxide (NO) (grey 
circles) as well as calculated peroxy radical (HO2+ ROz) (blue circles) concentrations. 
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where k; is the reaction rate constant between HO, and CH3Q) (7.7 x 10~ !?2cm3 


molecule~!5~!) at 298K. We assume that the concentration of HO, and RO; is 
the same. 

Figure 5a illustrates that the NO pathway is dominant, by at least a factor of 
20 until OH = 10 x 10° molecules cm ~ *h, and by initially thousands of times higher, 
over the peroxide pathway. Since r >> 1 we consider these experiments ‘high NOy’. 

An estimation of r during the tests on driving cycle emissions is complicated by 
the lack of an accurate NOx instrument (that is, one equipped with a blue light 
converter). Furthermore, NO was only continuously added during the experiment 
where emissions were sampled from the cold phase (which featured the highest 
VOC:NOx ratio of any experiment). However, given that the driving cycle tests 
generally produced higher NO emissions than the idling tests, and given that most 
fall on the yield curve in Fig. 2, we also assume r>> 1, although we can not rule out 
that during some experiments the conditions change from high to low NOx. 
Although the VOC:NOx ratios were high (around 50), our best estimate suggests 
that idling experiments were high NOx throughout. Figure 5b shows the measured 
concentrations of NO and O3, during the experiment. 


Idling scooter experiments. Emissions were introduced into the 27 m? Paul 
Scherrer Institute Teflon environmental chamber‘. The external temperature of 
the scooter exhaust was monitored (Thermocouple type K, Messelemente) and 
after an initial warming period of several minutes (consisting of idling or applying 
low power) the emissions were injected only when the external exhaust 
temperature was stable at idle or at simulated low power. Supplementary Table 2 
provides the operating conditions, smog chamber OA concentrations and aerosol 
EFs of this study used in Fig. 1. 

OA was monitored with a high-resolution time-of-flight aerosol mass 
spectrometer (Aerodyne). Unity collection efficiency is assumed, since emitted 
particles likely consist of spherical oil-like droplets with low bounce. After an initial 
spike in the OA concentration following sample injection, a time of at least 20 min 
was allowed for equilibration. The concentration of OA after this point was taken 
as the initial POA emission. A battery of 80 x 100 W ultraviolet black lights 
(ErgoLine ‘Cleo Performance’, Solarium lights), was used to initiate photo- 
oxidation and SOA formation. Experiments were carried out with a steady 
injection of NO (<20mlmin~!) whereby NO was maintained at around 2-3 
ppb(v). Relative humidity inside the smog chamber was between 40-60% for all 
experiments, and temperature was maintained at 25 °C. 

OA was corrected for wall losses using 


OAwic(t) = ee , (5) 


where OAwic(t) and OAyeas(t) are the wall-loss corrected and measured organic 
matter concentrations, respectively, as a function of time t, and k is the first order 
mass-loss rate constant determined from an exponential fit of BC data. 

VOCs inside the smog chamber were quantified with a quadrupole proton 
transfer reaction mass spectrometer, while carbon monoxide was quantified with a 
dedicated CO monitor (Aerolaser, CO-Monitor AL5002) and total gas-phase 
hydrocarbons were measured from the chamber using a flame ionization detector 
(FID, J.U.M model VE 7). Additional measurements at the tailpipe were performed 
by transferring emissions through a heated line (191°C) to a Fourier transformed 
infrared spectrometer (MKS Multigas analyser 2030) for online measurements (at 
1 Hz) of small hydrocarbons, nitrogen containing species (NO, NO, N.O, NH3 
and HCN) and other oxygenated small organics (formaldehyde, acetaldehyde), as 
well as CO and CO;. 


Online reactive oxygen species measurements. Online particle-bound ROS 
analysis utilized the fluorescence probe 2,7-dichlorofluorescein in solution. Parti- 
cles were collected and continuously extracted on a wetted hydrophilic filter. The 
particle collector samples air at 51min ~! and collects particles larger than aero- 
dynamic diameter 50 nm with greater than 95% efficiency. Particles are collected 
and extracted in an aqueous solution of horseradish peroxidase (0.5 U ml~ !) 
allowing immediate reaction of ROS on collection. The concentration of ROS is 
characterized following subsequent reaction of the oxidized horseradish peroxidase 
with 2,7-dichlorofluorescein (5 1M) for 10 min at 40°C, yielding the fluorescent 
product DCF in the continuous flow set up. The concentration of 2,7-dichloro- 
fluorescein is measured using fluorescence spectroscopy in a flow-through cell and 
calibrated to ROS concentration with hydrogen peroxide. ROS data in Fig. 3 are 
normalized to the total carbon m~ 3, determined from high-resolution fitting of 
aerosol mass spectrometer data, and presented as a percentage. 


Driving cycle scooter experiments. The Paul Scherrer Institute mobile smog 
chamber? was deployed, and experiments conducted in a certified chassis 
dynamometer test cell (Vehicle Emissions Laboratories, Joint Research Centre of 
the European Commission, JRC-Ispra, Italy)**°. Emissions from 2S scooters were 
sampled at the tailpipe during full ECE47 driving cycles, during Ph1 only of the 
ECE47 (first four modules of the driving cycle, Ph1), and during Ph2 only of the 
ECE47 (final four modules of the driving cycle, Ph2). The emissions were 
transferred to the smog chamber via a heated inlet system (150°C) and Dekati 
ejector dilutor. Ultraviolet lights were switched on after several minutes to initiate 


photochemistry. OA concentrations were measured with a high-resolution time-of- 
flight aerosol mass spectrometer (Aerodyne), while black carbon was quantified 
with an aethalometer (AE33, Aerosol d.o.0.). The exponential decay rate of black 
carbon k was used in equation (5) to correct for particle losses to the walls. Gas- 
phase compounds were monitored with a proton transfer reaction time-of-flight 
mass spectrometer (Ionicon), while CO, and CO were measured using a cavity ring 
down spectrometer (Picarro, G2401) and total hydrocarbons were measured with a 
flame ionization detector (Horiba, THC Monitor APHA-370). 


Emission factor determination. EFs from both measurement campaigns (EF, 
gCkg~! fuel), (see also Supplementary Table 3), were calculated using a carbon 


mass balance: 
EF = ae) We, (6) 
Cco2 + Cco + Cuc 


where C denotes carbon mass, and the subscripts CO2, CO, HC, carbon dioxide, 
carbon monoxide and hydrocarbon, respectively. W. is the fuel carbon content 
(0.847 for gasoline). 

For the idling scooter experiments, Coo and Cco2 were measured at the tailpipe 
using the Fourier transformed infrared spectrometer. Cjy¢ was measured from the 
smog chamber and scaled-up to the tailpipe concentration using the dilution ratio 
CO aitpipe/COsmog chamber- Meanwhile, for the driving cycles all concentrations were 
determined at the smog chamber. 


Emission factors from the literature. Figure 1 and Supplementary Table 3 show 
EFs calculated from the literature. When available, EFs are given as reported. OA 
EFs measured in tunnels/roadside are assumed to consist purely of POA and are 
converted to EFs in units of gkg~ ! fuel using an organic matter to organic carbon 
ratio (OM:OC) of 1.2 (ref. 30). EFs given in units of gkm ~1! are converted using 
the following fuel consumptions (kmkg~!): Asia LDVs: 16.43; US LDVs: 14.93; 
EU LDVs: 18.20; Heavy-duty vehicles: 2.85. EFs measured during the Kansas City 
vehicle study are estimated by inserting per km EFs into equation (6). SOA in 
Supplementary Table 3 (gC kg ~! fuel) is converted from units of gkg~ ! fuel using 
an OM:OC ratio of 2.0 (ref. 30). SOA formation from 2S scooters is not available in 
the literature, but was estimated from emissions of aromatic hydrocarbons using a 
yield (see Supplementary Note 1 and Fig. 2) of 8.4% (suspended OA concentration 
50 gm ~ 3). Further notes to individual studies are also provided in 
Supplementary Table 3. 
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Urban characteristics attributable to 
density-driven tie formation 


Wei Pan!, Gourab Ghoshal!, Coco Krumme!, Manuel Cebrian!?? & Alex Pentland! 


Motivated by empirical evidence on the interplay between geography, population density and 
societal interaction, we propose a generative process for the evolution of social structure in 
cities. Our analytical and simulation results predict both super-linear scaling of social-tie 
density and information contagion as a function of the population. Here we demonstrate that 
our model provides a robust and accurate fit for the dependency of city characteristics with 
city-size, ranging from individual-level dyadic interactions (number of acquaintances, volume 
of communication) to population level variables (contagious disease rates, patenting activity, 
economic productivity and crime) without the need to appeal to heterogeneity, modularity, 
specialization or hierarchy. 
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larger percentage of people live in cities than at any point 

in human history!, while the density of urban areas is 

generally increasing”. One of the enduring paradoxes of 
urban economics concerns why people continue to move to cities, 
despite elevated levels of crime, pollution, disease and wage 
premiums that have steadily lost ground to premiums on rent?. 
New York in the 18th century, according to Thomas Jefferson, was 
‘a toilet of all the depravities of human nature’. Since Jefferson’s 
day, the city has grown to host the depravities of 100-fold more 
people, yet the stream of new arrivals has not stemmed. 

While the forces behind any urban migration are complex, the 
advantages afforded by urban density comprise an important 
driver. Smith* was one of the first to point to urban centres as 
exceptional aggregators, whether of innovations or depravities. 
Cities appear to support levels of enterprise impossible in the 
countryside, and urban areas use resources more efficiently, 
producing more patents and inventions with fewer roads and 
services per capita than rural areas>~!°. 

Despite the widespread focus on density as a driver for the 
uniqueness of cities both in scientific and popular audiences, we 
still lack a compelling generative model for why an agglomeration 
of people might confer an advantage. Important advances in 
several fronts have highlighted the difficulty in gaining an 
understanding of the urban processes beyond the density 
description level. Early economic models of agglomeration point 
to the role of technology diffusion in creating intellectual 
capital'!~!3, but lack a quantitative description of the generative 
mechanism for how this diffusion happens. Hierarchies have 
also been proposed as an elegant mechanism for this growth’; 
however, recent studies hint at the absence of well-defined 
hierarchy across geographical scales!*-!°. It has also been 
observed”” that diversity among residents and their interming- 
ling displays a weak correlation with cities’ success thus 
prompting the authors to conclude ‘more fine-scale data on 
interactions among people of different disciplines—or the culture, 
laws and peculiarities of cities—is required to better assess the 
under- or overperformance of innovation of cities’. 

Recent developments in the study of social networks shed some 
light on this challenge. Empirical evidence suggests that 
interactions and information exchange on social networks 
are often the driving force for idea-creation, productivity and 
individual prosperity. Examples of this include the theory of weak 
ties*)?, structural holes”, the strong effect of social interaction 
on economic and social success”4, the influence of face-to-face 
interactions on the effect of productivity, as well as the 
importance of information flow in the management of Research 
and Development”>°. Consequently, it seems that understanding 
the mechanism of tie formation in cities is the key to the 
development of a general theory for a city’s growth described by 
it’s economic indicators and its population. Following this line of 
thinking, our proposed answer for super-linear growth of cities 
can be regarded as a natural extension of Krugman’s insights 
on industries®. Krugman pointed out the connection between 
manufacturing efficiency and transportation of goods as a 
function of proximity of factories. Similarly, our theory 
connects the efficiency of idea-creation and information flow to 
the proximity of individuals generating them. 

These ideas have a long lineage in urban sociology, urban 
geography and economic geography. Louis Wirth, for example, 
conceptualized these processes in the late 1930s (ref. 27), 
prompting a vast literature in economic geography that explores 
the relationship between density and innovation, as well as that 
between diffusion (via social ties) and population density (going 
back to Hagerstrand in the 1950s (refs 28,29); and a more recent, 
but well-established literature on density and creativity (see, for 
example, Richard Florida’s work*°->?), 


In this paper, we present a simple, bottom-up, robust model 
describing the efficient creation of ideas and increased productiv- 
ity in cities. This contribution’s goal is to integrate these ideas into 
a single mathematical model that can be tested against available 
empirical data. Our model consists of two essential features. We 
propose a simple analytical model for the number of social ties 
T(p) formed between individuals, with population density p as its 
single parameter. We demonstrate that increases in density and 
proximity of populations in cities leads to a super-linear growth of 
social-tie density for urban population. We then show that the 
diffusion rate along these ties—a proxy for the amount and speed 
of information flow and idea adoption—accurately reproduces the 
empirically measured scaling of urban features such as rate of 
AIDS/HIV (acquired immunodeficiency syndrome/human 
immunodeficiency virus) infections, communication and GDP 
(gross domestic product). The model naturally leads to a super- 
linear scaling of indicators with city population? without the need 
to resort to any parameter tuning (although it predicts a different 
functional form than a simple power-law and is a more accurate 
match to the data). The surprisingly similar scaling exponent 
across many different urban indicators (see Supplementary Note 1 
and Supplementary Table S1), suggests a common mechanism 
behind them. Social-tie density and information flow, therefore, 
offer a parsimonious, generative link between human communi- 
cation patterns, human mobility patterns and the characteristics of 
urban economies, without the need to appeal to hierarchy, 
specialization or similar social constructs. 


Results 

A model for social-tie density. We propose to model the for- 
mation of ties between individuals (represented as nodes) at the 
resolution of urban centres. As our model is based on geography, 
a natural setting for it is a two-dimensional Euclidean space with 
nodes denoted by the coordinates x; € R? on the infinite plane. 
Furthermore, we also assume that these nodes are distributed 
uniformly in space, according to a density p defined as, 


p=no. of nodes per unit area. 


While the assumption of uniform density is an approximation, 
the qualitative features of the model are unaffected by other more 
realistic choices of the density distribution—see Supplementary 
Note 2 and Supplementary Fig. S1. Following Liben-Nowell 


et al.*>, we define the probability of a tie to form between two 
nodes i, j in the plane as 
1 
Pi x ——., 1 
aa rank; (j) ”) 


where the rank is defined as 
rank;(j): =|{k: d(i,k) < d(i,j)}, (2) 


and dij is the Euclidean distance between the two nodes. If j lies at 
a radial distance r from node i, then the number of neighbours 
closer to i than j is the product of the density and the area of the 
circle of radius r, and thus the rank is simply, 


rank;(j) = prr’, (3) 
which implies that the probability an individual forms a tie at 
distance r goes as P(r) ~ 1/zr’, similar in spirit to a gravity 
model##. 

For a randomly chosen node, integrating over r up to an urban 


mobility ‘boundary’ denoted as rmax, we obtain the expected 
number of social ties for this chosen node denoted as t(p). 


t(p) = Inp+C, (4) 


where C=2Infmax + Ina+1. We note that rmax may well be 
unique for each city, and is often determined by geographical 
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constraints, as well as city infrastructure (cf. Supplementary Note 
3 and Supplementary Fig. S2-S3). Integrating over the number of 
social ties for all nodes within a unit area gives us the social-tie 
density T(p), 


T(p)=plnp+Cp, (5) 


with C’=C-—1. Thus the density of social ties formed between 
individuals grows as T(p) ~ plnp, a_ super-linear scaling 
consistent with the observations made by Calabrese et al. 

(also discussed in the content below). We argue that T(p) to a first 
approximation is the individual dyadic-level ingredient behind 
the empirically observed growth of city indicators. For more 
detail on the theoretical analysis and support for the assumptions 
involved, see Supplementary Note 4 and 5 and Supplementary 
Figs S4—S6. 

In order to test this theoretical result, we perform simulations 
of tie formation with more realistic discrete settings. Urban areas 
differ dramatically in both regional boundaries and population 
density. It is thus important to test the sensitivity of the model to 
a diversity of input parameters for the density p and the urban 
‘boundary’ rmax- We start from an empty lattice of size N x N, 
with N? possible locations. The density p is gradually increased by 
randomly assigning new nodes to empty locations on the grid, 


where each node represents a small community, or city block of 
10* individuals. Once a node is added, the probability of forming 
a tie with one of its existing neighbours is computed by counting 
the number of nodes closer to this node according to 
equation (1). To test the sensitivity of our results to the relevant 
parameters we vary the size of the grid (20 < N< 400) of blocks to 
mimic different scales for city boundaries ryax assuming Tmax is 
the size of the grid. In addition, we also vary city population 
between 104 and 107 residents, as well as the functional form of 
the density distribution. 

In Fig. 1 we show the average over 30 realizations of the 
simulation for different values of the grid size N and 
city boundary fmax. The density p in this case represents 
the relative percentage of occupied locations on the grid, and 
T(p) the total number of ties formed between nodes. As Fig. 1 
shows, the agreement between the theoretical expression for T(p) 
equation (5) and the curves generated by the simulation, is 
excellent at all scales despite our continuum approximation 
(R* 1). 

As a comparative exercise, on the same plot, we also show 
the best fit to the form T(p) ~ p? and find a value of 61.16. 
We note that this value is strikingly similar to empirically 
observed values by fitting a power-law to the relationship 
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Figure 1 | The number of social ties as function of grid sizes and urban mobility limits. The number of ties T() plotted as a function of p for various 
grid sizes N. The data points represent the average over n= 30 realizations of the simulation described in the text, while the solid green line is the 
theoretical expression equation (5). The dashed line is a fit to the form T(p) ~ p®. As can be seen in each case the agreement between theory and 
simulation is excellent. The best fit to the scaling exponent yields a value of 61.15 independent of N. Note that the measured value of the exponent 


in empirical data is 1.1<f$<1.3. 
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between population and urban indicators. It has been suggested 
that a fit of the form xInx can easily be mistaken for x? (ref. 36), 
which together with our model suggests that the observed scaling 
of cities may alternatively be described by equation (5). The latter 
functional form is additionally supported by the fact that it 
represents a generative model for the emergence of urban features 
as a result of density-driven communication patterns, without any 
parameter tuning or a priori assumption about the structure of 
the underlying social network. Our simulation results indicate 
that the scaling described in equation (5) is robust with respect to 
the choice of different functional forms for the density 
distribution. (Supplementary Note 2 and Supplementary Fig. 1). 


100,000 + 


1,000} © %*" «Data 
: -— Power-law f=1.14, R?=0.81 (Calabrese et al. 35) 
—— Our model, R?=0.99 


1,000 10,000 100,000 10,000,000 
County population 


Overall call time within county 


Figure 2 | Overall time of calls between residents of a county as a 
function of its population. The points refer to the data (adapted from 
Calabrese et al.2° computed from ten million users’ mobile phone call 
records within United States during July 2010), while the solid line is the 
theoretical prediction from the model equation (5) adapted to raw 
population. The model captures both the super-linear growth and tilts on 
both ends of the curve while providing a superior fit to the data (based on 
adjusted R-value) when compared with a pure power-law relation (dashed 
curve). 
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Empirical evidence for the effect of social-tie density. Recent 
work*> shows a super-linear relationship between calling volume 
(time) and population across different counties in the United 
States. As Fig. 2 illustrates, the super-linear relationship in 
the data is approximated by the authors as a power-law 
growth y= ax! with B~1.14. However, by assuming a uniform 
distribution on county sizes and treating population as a proxy 
for density, we show that our density-driven model is able to 
capture precisely the distribution of the call volume. The model 
produces the exact shape of the curve, including the power-law 
growth pattern (f = 1.14) and tilts on both end, with an adjusted 
R? =0.99 (see Fig. 2). Consequently, we propose that the model 
may well provide a reasonable explanation for communication 
patterns observed in US counties. 


Information diffusion and adoption with social-tie density. We 
note that the expected patterns of link and interaction formation 
in itself is insufficient to explain how growth processes in cities 
work to create observed certain scaling phenomena such as 
productivity and innovations. Instead, we believe that the manner 
in which these links spread information and encourage idea and 
behaviour adoptions actually determines value-creation and 
productivity. As it is known that social network structure has a 
dramatic effect on the access of information and ideas*)?423,2.26, 
it seems plausible that higher social-tie density should engender 
greater levels of idea spreading leading to the observed increases 
in productivity and innovation. 

To test the hypothesis that a city’s productivity is related to 
how far information travels and how fast its citizens gain access to 
innovations or information, it is natural to examine how this 
information flow scales with population density, and to quantify 
the functional relationship between link topology and speed of 
information spreading. We, therefore, simulated two models of 
contagion of information diffusion*”~°’ on networks generated 
by our model. The first contagion model simulates diffusion 
of simple facts, where a single exposure is enough to guarantee 
transmission. The second more complex diffusion model is 
typical of behaviour adoption, where multiple exposures to a new 
influence/idea is required before an individual adopts it. In Fig. 3, 
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Figure 3 | The spreading rate as a function of density for two different contagion models. (a) The mean spreading rate as a function of density p. 
The points correspond to n= 30 realizations of simulations of the Sl model on a 200 x 200 grid. The dashed line corresponds to a fit of the form 
R(p) ~ p'** with «=0.18. The solid line is a fit to the social-tie density model. (b) The mean spreading rate as a function of p under the complex 
contagion diffusion model based on n= 30 realizations of simulations. The dashed line corresponds to the power-law fit of the form R(p) ~ p't% 

with «= 0.17. Once again the solid line is the fit to the model described in the paper. In both cases, the social-tie density model provides a better fit than a 
simple power-law with with much lower mean-square errors (29% and 41% lower respectively). 
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we discover that in both susceptible-infectious (SI) and complex 
contagion models the mean diffusion speed grows in a super- 
linear fashion with f+ 1.2, in line with our previous results and 
match well with the disease spreading indicators in cities”. As a 
consequence we conclude that an explanation for the observed 
super-linear scaling in productivity with increasing population 
density is the super-linear scaling of information flow within the 
social network. 


Population level variables. While in most cases it is not possible 
to obtain the social-tie density of a city directly, our model sug- 
gests that population density is strongly correlated with social-tie 
density across cities with similar transportation infrastructure and 
economic situations (that is, similar r,,,,). Therefore, we here 
explore social-tie density indirectly by using population density 
measures, and we only focus on horizontal comparison of cities 
of similar levels of economic development, such as US cities and 
European Union cities. 
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As a test case for our hypothesis, we study the prevalence of 
AIDS/HIV infections in cities in the United States. In Fig. 4, we 
plot the prevalence of AIDS/HIV in 90 metropolitan areas in 
2008 (ref. 40) as a function of population density. As the figure 
indicates, there is fairly good agreement between the data and the 
curve generated by our model of diffusion using both the simple 
and complex contagion models. 

The same agreement holds for European cities on economic 
indicators. In Fig. 5, we plot the overall GDP per square km in 
NUST-2 (Nomenclature of Territorial Units for Statistics level-2) 
regions in the European Union as a function of population 
density p, as well as population size. The NUST-2 regions are 
defined by the European Union as the city-size level territorial 
partition for census and statistics purposes*!. We find a strong 
positive correlation between density and the corresponding urban 
metric with a super-linear scaling component, but conversely a 
much weak and sublinear growth pattern on raw population size. 
While it is not the main focus of this paper, we show that the 
super-linear growth on density can be often be indicated in data 
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Figure 4 | Spreading rate of HIV as a function of density in US Metropolitan Statistical Areas. The relationship between density and AIDS/HIV 
spreading rate of the 90 metropolitan statistical areas from recent Centers for Disease Control and Prevention and US Census surveys. As is visible, the 


model captures the qualitative trends in the data. 


i) 

7 * Data 

& /— += Power-law f=1.26, R?=0.82 * 
3 Our model, R?=0.82 

(o) 8 

E 10 

2 

z 

© 107 

oO 

= 

a 

5 10° « 

ao 

ra) 

© 10° rn 

a 

ie) . . 
ke 


100 1,000 10,000 
Population density per square kilometer (x10°) 


* Data ¥ * 
-— + = Power-law f=0.85, R?=0.51 & 

108 se“ 
S " 
p= | 
wi 
& . 
S107 um 
g & 
g a sek 
e ae 

2 
10° oe 
100 1,000 10,000 


Population (x103) 


Figure 5 | Correlation between GDP and population, as well as correlation between GDP and population density for all 247 NUST-2 regions 

in the European Union. Left panel: correlation between density and GDP, suggesting a strong correlation with a super-linear functional form as predicted 
by the model. A pure power-law fit to the data is also shown for illustrative purposes. Right panel: the correlation between population and GDP this 
time showing a sublinear functional form. However, the poor R2-value suggests that raw population does not correlate, as well as density with GDP 


growth in cities. 
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as super-linear growth on population, and that density is a better 
indicator for socio-economic growth than population—see 
Supplementary Note 4. 

Note that in both data sets the scaling exponents are restricted 
within a narrow band 1.1<f/<1.3, potentially suggesting a 
common mechanism behind both the prevalence of AIDS/HIV 
and scaling of GDP with respect to the population density. 
An advantage afforded by our model is the need to dispense 
with parameter tuning, as the model naturally produces this 
scaling within a reasonable margin of error. Thus, by considering 
social structure and information/disease flow as a major driving 
force in many of the city indicators, our approach provides 
a unique and general theory to the super scaling phenomena 
of cities. 

Both the spreading of information (potentially leading to 
increase productivity and innovation) and contagious 
diseases rely on the mechanism of social interactions. However, 
while information can mediate via Mass Media (exogenous) 
influence, and/or endogenous (word-of-mouth) processes, 
we chose to highlight the AIDS/HIV spreading data to validate 
our model, as an example of a purely endogenous process. 


Discussion 

In this paper, we propose social-tie density (the density of active 
social ties between city residents) as a key determinant behind the 
global social structure and flow of information between 
individuals. Based on this, we have described an empirically 
grounded generative model of social-tie density to account for the 
observed scaling behaviour of city indicators as a function of 
population density. Our model accurately explains how urban 
density drives the super-linear growth of social interaction 
density*>, and eventually the super-linear growth of 
productivity as observed in many empirical data sets. 

The conceptual distinction between density and social density 
is an important one, as it has been missing in popular accounts 
that may percolate to urban planners and policy-makers. As a 
matter of fact, throughout the 20th century, the United States has 
witnessed major shifts from dense to suburban, then back to 
dense urban planning. As recently as 1970, suburban populations 
surpassed the urban one—motivated by a search for an idealized 
small, low-density, locally oriented community*”. Indeed, these 
back and forth shifts may have been facilitated by an incomplete 
understanding of the benefits afforded by urban density. 

The model predicts that social-tie density scales super-linearly 
with population density, while naturally accounting for the 
narrow band of scaling exponents empirically observed across 
multiple features and different geographies. We note that this is 
achieved without the need to resort to parameter tuning or 
assumptions about heterogeneity, modularity, social hierarchies, 
specialization or similar social constructs. We, therefore, suggest 
that population density, rather than population size per se, is at 
the root of the extraordinary nature of urban centres. As a single 
example, metropolitan Tokyo has roughly the same population 
as Siberia while showing remarkable variance in criminal 
profile, energy usage and economic productivity. We provide 
empirical evidence based on studies of indicators in European 
and American cities (both categories representing comparable 
economic development), demonstrating that density is a superior 
metric than population size in explaining various urban 
indicators. 

We note that current technology makes remote communica- 
tion and collaboration extremely easy and convenient; however, 
the importance of packing people physically close to each other is 
still widely emphasized*?-*9. We postulate that cities potentially 
operate under the same principle-as a consequence of 


proximity and easy face-to-face access between individuals— 
communication and ultimately productivity is greatly enhanced. 
Thus, though it is reasonable to surmise that individuals 
migrate to the city for reasons connected with individual 
needs and preferences, our argument suggests that, it is the 
benefits afforded by social-tie density that maintains them 
as residents. 

While our model provides a fundamental first-principles basis 
for explaining productivity of cities, we note the importance of 
higher-order variables such as transportation infrastructure in 
order to tailor the model to specific cases to get better results. 
As an example, the density of social ties is intrinsically a function 
of the ease of access between residents living in the same city. 
Consider the case of Beijing, which has a very high population 
density, but due to its traffic jams, is currently de-facto divided 
into many smaller cities with limited transportation capacities 
between them. Consequently, it may not demonstrate a higher 
social-tie density than other cities with a much lower population 
density. Thus a direct comparison of the model predictions with a 
similarly dense area such as Manhattan needs to take into account 
this refinement. In keeping with the spirit of the simplicity and 
bottom-up approach of our model, we chose to use data from 
cities within the United States and the European Union such that 
extraneous variables are controlled for. 

A number of theories of urban growth suggest the importance 
of specialist service industries, or high-value-add workers, as 
generative models of city development. While our model does not 
disprove these theories, it provides a plausible and empirically 
grounded model that does not require the presence of these 
special social structures. The other theories must, therefore, 
appeal to different sorts of data in order to support their claims. 
Cities are one of most exceptional and enduring of human 
inventions. Most great cities are exceptions in their own right: a 
New Yorker feels out of place in Los Angeles, Paris or Shanghai. 
However, this exceptionalism may be more due to our attention 
to human-scale details than to the underlying structures. In this 
paper, we have presented a generative theory that accounts for 
observed scaling in urban growth as a function of social-tie 
density and the diffusion of information across those ties. It is our 
hope that this provides both a foundation for the commonalities 
across all cities and a beginning point for which divergence 
between specific cities can be explored. 


Methods 

Data sets. All data sets used for analysis in this paper are publicly available. 
We collected data from the official websites of US Centre for Disease 
Control and Prevention, US Census Bureau and the Statistical Office of 

the European Union. The detailed information for each data set is provided 
in the paper. 


Diffusion models. Assuming that the spread of information and disease are 
archetypes of simple contagions, for the simple exposure contagion case, we run 
the SI model?”8 on networks generated by our model, and measure the speed 
at which the infection reaches a finite fraction of the population. We start by 
generating networks according to the process described in the introduction and 
then randomly pick 1% of the nodes as seeds (that is, initial infected nodes). 
The probability of an infection at a given time-step, to spread from an infected to a 
susceptible node, is denoted as ¢, which we fix to be €= 1 x 10~*. The simulation 
terminates at the point when 10% of the population is in the infected state. 
The networks generated are snapshots at different densities p and as before we 
vary the size of the grid N. 

For the complex contagion case, we adopt the complex contagion model*?. 
We assume that 10% of the population follows a simple contagion process: 
an individual can be infected by a single infected neighbour; the remaining 
90% of the population follows a complex contagion process: an individual is 
infected, if at least two of its neighbours are infected. The rest of the simulation 
is identical to the simulation with the SI model, and we measure the time steps 
required to infect 10% of the population. 
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Denoting S(p) as the number of time steps taken to infect 10% of the 


population, the mean spreading rate R(~) shown in Fig. 3 is computed using: 


R(p) = a- (6) 


Assuming that the mean spreading rate is proportional to the network density 


(that is, R(p) oc T(p)), we also fit the data to the form 
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Strong contributions of local background climate to 


urban heat islands 


Lei Zhao”, Xuhui Lee, Ronald B. Smith? & Keith Oleson* 


The urban heat island (UHI), acommon phenomenon in which sur- 
face temperatures are higher in urban areas than in surrounding 
rural areas, represents one of the most significant human-induced 
changes to Earth’s surface climate’. Even though they are localized 
hotspots in the landscape, UHIs have a profound impact on the 
lives of urban residents, who comprise more than half of the world’s 
population’. A barrier to UHI mitigation is the lack of quantitative 
attribution of the various contributions to UHI intensity* (expressed 
as the temperature difference between urban and rural areas, AT). 
A common perception is that reduction in evaporative cooling in 
urban land is the dominant driver of AT (ref. 5). Here we use a cli- 
mate model to show that, for cities across North America, geographic 
variations in daytime AT are largely explained by variations in the 
efficiency with which urban and rural areas convect heat to the lower 
atmosphere. If urban areas are aerodynamically smoother than sur- 
rounding rural areas, urban heat dissipation is relatively less efficient 
and urban warming occurs (and vice versa). This convection effect 
depends on the local background climate, increasing daytime AT by 


Annual-mean daytime AT (K) 


3.0 + 0.3 kelvin (mean and standard error) in humid climates but 
decreasing A Tby 1.5 + 0.2 kelvin in dry climates. In the humid east- 
ern United States, there is evidence of higher A Tin drier years. These 
relationships imply that UHIs will exacerbate heatwave stress on human 
health in wet climates where high temperature effects are already 
compounded by high air humidity®’ and in drier years when pos- 
itive temperature anomalies may be reinforced by a precipitation- 
temperature feedback*. Our results support albedo management as 
a viable means of reducing AT on large scales”””. 

The conversion of natural land to urban land causes several notable 
perturbations to the Earth’s surface energy balance. Reduction of evap- 
orative cooling is generally thought to be the dominant factor contrib- 
uting to UHI. Anthropogenic heat release is an added energy input to 
the energy balance and should increase the surface temperature. Energy 
input by solar radiation will also increase if albedo is reduced in the 
process of land conversion. Buildings and other artificial materials can 
store more radiation energy in the daytime than can natural vegetation 
and soil; release of the stored energy at night contributes to night-time 


Figure 1 | Precipitation and population 
influences on MODIS-derived annual-mean 
UHI intensity. a, Map of daytime UHI (shown in 
K by symbol type/size). b, Dependence of daytime 
UHI on precipitation (r = 0.74, P< 0.001). 

c, Map of night-time UHI. d, Dependence of 
night-time UHI on population (r = 0.54, 
P<0.001). Red, green and blue symbols denote 
cities with annual mean precipitations less than 
500 mm, between 500 and 1,100 mm, and over 
1,100 mm, respectively. Lines in b and d are linear 
regression fits to the data. Parameter bounds for 
the regression slope are the 95% confidence 
interval. 
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UHI. Finally, energy redistribution through convection between the sur- 
face and the atmospheric boundary layer can either increase or reduce 
AT, depending on whether the efficiency of convection over urban land 
is suppressed or enhanced relative to that over adjacent rural land. 
Although these concepts have been known for some time’!’, a quant- 
itative understanding of their roles across different climate regimes 
remains elusive’. 

The climatic context can be understood by posing the following ques- 
tion in a thought experiment: if two cities are built identically in terms of 
morphological and anthropogenic aspects but in different climates, will 
they have the same AT? The answer depends on time of day according 
to observations of surface temperature by the NASA MODIS satellite. 
For 65 selected cities in North America, the annual-mean midnight AT 
(surface temperature of urban core pixels minus that of rural pixels) is 
positively correlated with the logarithm of population (correlation coef- 
ficient, r = 0.54; confidence level, P < 0.001; Fig. 1d), but is invariant 
with climate, showing a statistically insignificant correlation with pre- 
cipitation (r = 0.05, P = 0.70; Extended Data Fig. 1), solar radiation 
(r = 0.15, P > 0.20) and air temperature (r = 0.20, P > 0.10). However, 
the annual-mean midday AT is strongly correlated with precipitation 
(r = 0.74, P< 0.001; Fig. 1b) and has a weaker statistical dependence 
on population size than does the night-time AT (r = 0.27, P = 0.027; 
Extended Data Fig. 1). The night-time AT shows little spatial coherence 
(Fig. 1c), but the daytime AT has a discernible spatial pattern that follows 
precipitation gradients across the continent (Fig. 1a). Twenty-four of 
the cities are located in the humid southeast United States, which coin- 
cides roughly with the Koppen-Geiger temperate climate zone (Fig. 2a). 
Their daytime annual-mean AT is on average 3.9 K and is 3.3 K higher 
than that of the 15 cities in the dry region (Fig. 2d, e). By comparison, the 
night-time AT differs by 0.1 K between the two groups (P > 0.60; Fig. 2f, g). 
These results are in broad agreement with previous remote-sensing 
studies on UHI across biophysical and developmental gradients'*"””. 
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Figure 2 | Attribution of UHI intensity in three Képpen-Geiger climate 
zones. a, Map of climate zones: white, mild temperate/mesothermal climate; 
grey, continental /microthermal climate; dark grey, dry climate. b, d, e, Daytime 
values of MODIS and modelled AT and its component contributions in each 
of the three zones (see arrows). c, f, g, Night-time values in each of the three 
zones (see arrows). Green bars denote model-predicted AT and blue bars 
denote UHI intensity calculated as the sum of the component contributions. 
Error bars, 1 s.e. for each climate zone. 
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At first glance, the relationship with precipitation (Fig. 1b) seems con- 
sistent with the hypothesis that reduction in evaporative cooling in urban 
land is the main driver of daytime AT, because the denser vegetation in 
wet climate regions has a higher evaporation rate than the vegetation 
in dry climates. However, our model-based analysis does not support 
such an interpretation. In the model domain, AT is a perturbation signal 
to the surface temperature caused by biophysical contrast between rural 
and urban land units in the same model grid cell’*. This signal is further 
decomposed, using the method described in ref. 19, into contributions 
from changes in radiation balance, evaporation, convection efficiency 
and heat storage, and from anthropogenic heat addition (Fig. 2). The 
credibility of the model is supported by the reasonable agreement of 
the modelled AT with the MODIS AT (r = 0.31, P < 0.02 for daytime; 
r = 0.30, P< 0.025 for night time) and by its accurate depiction of the 
relationship between night-time AT and albedo (Extended Data Fig. 4). 
Furthermore, the model has reproduced the observed positive correla- 
tion between the daytime AT and precipitation (Fig. 3a). 
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Figure 3 | Relationship between model-predicted daytime AT and 
precipitation among the cities. a, Correlation of AT and the individual 
biophysical components with annual-mean precipitation. Lines are linear 
regression fits to the corresponding data. Parameter bounds for the regression 
slope are the 95% confidence interval. b, AT-precipitation covariance 
explained by different biophysical factors. Note that the covariance explained 
by the anthropogenic heat term is negligibly small. 
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We find that it is the changes in convection efficiency (associated 
with aerodynamic resistance changes), rather than those in evapotran- 
spiration, that control the daytime AT-precipitation spatial covariance 
among the cities (Fig. 3). In the humid climate (the Képpen-Geiger 
temperate climate zone), convection is less efficient at dissipating heat 
from urban land than from rural land, and the associated temperature 
increase is 3.0 + 0.3 K, which dominates the overall AT (Fig. 2e). At 
these locations, the rural land is in general densely vegetated, owing to 
ample precipitation, and is aerodynamically rough. Quantitatively, this 
difference is manifested in a lower aerodynamic resistance to sensible 
heat diffusion in the rural land (39sm_') than in the urban land 
(62sm_'). Measured in terms of aerodynamic resistance, urbaniza- 
tion has reduced the convection efficiency by 58%. 

The opposite occurs in the dry climate zone, where urban land is 
rougher than rural land and has enhanced convection efficiency. The 
result is actually a cooling effect (Fig. 2d). In this zone, the urban land- 
scape has lower aerodynamic resistance (53sm_') than the adjacent 
rural land (66 sm’ '), which is typically inhabited by vegetation of low 
stature such as shrubs, sagebrushes and grasses. On average, the urban 
land is about 20% more efficient in removing heat from the surface by 
convection than is the rural land. The average cooling signal is — 1.5 + 
0.2 K. In a few of the cities, convection is much more efficient than in 
the surrounding natural land, such that AT becomes negative (Figs la 
and 3a). It has been suggested that negative AT, a phenomenon known 
as ‘urban heat sink’, arises from evaporative cooling of trees and lawns 
planted in the city'*"’”. Our explanation seems more logical, because the 
MODIS urban temperature comes from the urban core pixels with neg- 
ligible amounts of vegetation cover (enhanced vegetation index, <0.18) 
and the urban land unit in the climate model is completely free of veg- 
etation. An analogous situation exists in a semi-arid plantation forest 
where trees serve as efficient ‘heat convectors’, leading to a lower sur- 
face temperature than in the adjacent smoother shrub land”. 

At night, release of the stored heat is the dominant contributor to 
AT across all three climate zones (Fig. 2c, f and g). The dependence on 
population size (Fig. 1d), which is an indicator of the city’s horizontal 
dimension, can be understood in light of these results. At night, the re- 
leased heat is trapped in a very shallow atmospheric boundary layer. As 


air moves across the urban land, it will accumulate more heat with in- 
creasing travel distance. Having a longer upwind fetch, that is, a longer 
distance between the upwind edge of the city and the point of obser- 
vation, the centre ofa larger city should experience stronger warming’. 

There is some evidence of precipitation control on interannual vari- 
ability in the daytime AT for individual cities. For each city, we have 
calculated the linear regression slope of the annual daytime AT against 
the annual precipitation, and we refer to it as the temporal sensitivity to 
precipitation. Both the MODIS and the model data show a negative 
dependence of the sensitivity on site mean precipitation (Fig. 4b, d). 
Twenty-four cities have annual-mean precipitation exceeding 1,100 mm. 
According to the model, all of these cities, which are mostly distributed 
in the eastern United States, have negative temporal sensitivity (Fig. 4c), 
meaning higher AT in drier years. The mean temporal sensitivity of this 
group of cities is —0.0021 Kmm '. The MODIS results are less con- 
sistent because of shorter data records, showing negative sensitivity for 
42% of them (Fig. 4a, b). 

To gain further insight into the interannual variability, we have com- 
pared the daytime AT-precipitation correlations for Billings in Mon- 
tana (annual-mean precipitation, 353 mm) and Richmond in Virginia 
(1,183 mm). We choose these two cities because they have nearly the 
same morphological and biophysical specifications (Extended Data 
Table 1) and therefore are essentially identical in the model world. The 
sensitivity to precipitation is positive at Billings and negative at Rich- 
mond (Extended Data Fig. 2). In contrast to the spatial variations across 
North America (Fig. 3b), the AT interannual variability shown here is 
driven primarily by changes in surface evaporation (Extended Data Fig. 3). 

Our results can be interpreted in the context of heatwave climatology. 
A measure of heatwave intensity is the degree of deviation, in multiples 
of standard deviation (North American mean value, o ~ 0.6 K) of sum- 
mertime temperature from the climatological mean”". For example, the 
2003 European heatwave’ is a rare event measured at 5c. These stat- 
istical considerations are based on regional background climatology. 
Being an additional anomaly on this background condition, UHI will 
aggravate heat stress on human health. In the southeast United States, 
where the heat stress is already amplified by high air humidity’, the 
daytime AT is equivalent to 7a (Fig. 2e). The situation may be further 


Figure 4 | Temporal sensitivity of UHI intensity 
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to precipitation. a, c, Map of the temporal 
sensitivities (shown in Kmm_' by symbol size) 
according to MODIS (a) and the climate model (c). 
b, d, Dependence of MODIS (b) and model- 
predicted (d) temporal sensitivity on annual 
mean precipitation. The outlier city in the MODIS 
panels is Whitehorse in Yukon. The four outlier 
cities in the model panels are Boise and Nampa in 
Idaho, Winnipeg in Manitoba and Calgary in 

‘ Alberta. Lines in b and d are linear regression fits 
to the data. Parameter bounds for the regression 
slope are the 95% confidence interval. 
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worsened in drier years when the positive temperature anomaly is likely 
to increase owing to a precipitation-temperature feedback*. Empirical 
evidence exists for such synergistic effects”. Using the temporal sens- 
itivity of 0.0021 K mm” ',a 500 mm reduction in the annual precip- 
itation corresponds to an increase in the daytime AT by 1.1 K, or ~2o. 
We caution that these numbers represent the upper bound of the UHI- 
added stress because UHI intensity at the screen height” (the height of 
air temperature observation at a standard weather station) and under 
all-sky conditions should be smaller than our AT, which is for clear 
skies and for the surface. However, summertime AT is generally larger 
than annual AT (refs 13-17, 24). 

The health impact of heatwaves is one factor that motivates the 
growing efforts to mitigate UHI. According to our results, a strategy 
that focuses on reducing anthropogenic heat would bring virtually no 
relief, but this might be because of the primitive anthropogenic heat 
scheme in the model’*. Managing the convection efficiency or heat stor- 
age of urban land does not seem viable, even though these are large con- 
tributors to AT, because it would require fundamental changes to the 
urban morphology, such as a city-wide increase in building height. How- 
ever, efforts to increase urban albedo have the promise of producing 
measurable results on a large scale. For the cities in the southern United 
States, the reduction of net radiation loading amounts to a daytime cool- 
ing effect of 0.7 K (Fig. 2e). In the model, this reduction is caused by the 
fact that these cities have an average albedo that is 0.06 higher than the 
surrounding rural land. This albedo difference is modest, considering 
that phasing in reflective roofs in Chicago” has already increased the 
city-wide albedo by ~0.02 and that some cool-roof implementations” 
aim to increase the urban-rural albedo contrast by as much as 0.6. Albedo 
increases have little direct effect on the night-time UHI (Fig. 2g) but 
may have an indirect cooling benefit through the reduction in the day- 
time heat storage and, therefore, less heat release from storage at night’*”°””. 
The negative correlation between the night-time AT and urban-rural 
albedo contrast'® (Extended Data Fig. 4) can be viewed as empirical evi- 
dence of this indirect benefit. 


METHODS SUMMARY 

MODIS data. We calculated the annual-mean AT using the MODIS-Aqua eight- 
day composite land surface temperature from 2003 to 2012. The night-time and 
daytime AT were determined at 1:30 and 13:30 local time, respectively. Data were 
collected at 65 cities distributed across the United States and Canada. For each city, 
we paired pixels in the centre of the city with those outside the city to determine AT. 
Climate model. We used the Community Earth System Model’* to simulate UHI. 
The model grid cell consists of urban and rural land units whose surface energy bal- 
ance variables are calculated using a single-layer urban surface parameterization 
anda standard land surface scheme, respectively'*”. We ran the model for 33 yr of 
simulation time from 1972 to 2004 after a 60 yr spin-up. The forcing data are an 
atmospheric reanalysis product validated against various observations”. The simu- 
lation was conducted at the finest resolution supported by the model (0.23° 
longitude X 0.31° latitude) to resolve individual cities. The surface skin temper- 
ature was determined from the emitted long-wave radiation. To be consistent with 
the MODIS observations, we used the modelled data at 1:00 and 13:00 local time 
and under clear-sky conditions to compute the annual AT and to perform the 
surface energy balance analysis. 

Attribution of UHI. Attribution of UHI is accomplished by a surface energy bal- 
ance analysis. The total AT is partitioned, using the method of ref. 19, into contri- 
butions from the differences, between the urban and rural land units, in surface 
radiation balance, convection efficiency, evapotranspiration and heat storage, and 
from anthropogenic heat addition. The perturbation to the radiation balance results 
mainly from albedo contrast and also includes a minor part associated with surface 
emissivity change. The analysis was done separately for daytime and night time 
using the relevant forcing and prognostic model variables. 


Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

MODIS LST, precipitation and population data. The UHI temperature differ- 
ence AT can be defined using shelter-derived air temperature or a satellite-derived 
radiative surface temperature. The former suffers from inhomogeneity in the urban 
landscape. The latter is a valuable spatial average, but is influenced by the emissivity 
of the surface. Neither exactly matches the human experience of UHI as an indi- 
vidual walks across the rural or urban landscape. For the purpose of comparing dif- 
ferent cities, the surface temperature approach is easier and more stable. Nichol et al. 
(ref. 31) showed that the correlation between the surface and screen-height air 
temperature can be weak on neighbourhood scales and improves considerably at 
the scale of urban-rural transition. 

MODIS Aqua land surface temperature (LST) data obtained at 65 cities in the 
United States and Canada were used in this analysis. This is an eight-day clear-sky 
composite data set. The spatial resolution is 1 km. The satellite overpass times are 
approximately 13:30 and 1:30 local time, which are close to the times of daily max- 
imum and minimum temperature, and the measurement therefore gives a better 
representation of the diurnal range of AT than does that of the other MODIS sat- 
ellite, Terra. According to the product quality control flag, the data we used have 
an average LST error less than or equal to 2 K. While selecting urban-rural paired 
pixels, we avoided the rural pixels that have large elevation differences and large 
latitude differences relative to the urban core. Specifically, the upper thresholds for 
elevation difference and latitude difference are 100 m and 0.1°. Nine urban pixels 
(3 X 3) were selected in the city centre, paired with 1-3 patches of 9-49 pixels each 
(3 X 3 to 7 X 7) in the surrounding rural land. Because of topographic and lati- 
tudinal limitations, the number of rural pixels varied (one patch for 15 cities, two 
patches for 41 cities and three patches for 9 cities). The magnitude of AT is insen- 
sitive to the number of urban-rural pixels. Fixing the number of urban and rural 
pixels for all the cities to one 3 X 3 patch of pixels altered AT by at most 0.6 K. All 
the pixels selected were validated by the MODIS land cover map and cross-checked 
against Google Earth. Rural pixels are classified in the MODIS land cover map as 
natural surfaces such as forests, grassland, cropland and bare soils. To avoid high 
bias of UHI, we excluded water pixels. Urban pixels are classified in the MODIS land 
cover map as urban and built-up surface. The resulting AT represents the difference 
between the city core and minimally developed land outside the city. The annual 
mean values were calculated based on the 10 yr time series of the MODIS LST 
(2003-2012). Linear gap filling was done for short periods of missing values to 
minimize the impact of missing data on the annual means. If there are more than 
three consecutive missing values, we excluded that year. 

Cities were chosen so that each state, province or territory was represented by at 
least one city, with the exception of four provinces anda territory in Canada (Nova 
Scotia, Prince Edward Island and Newfoundland and Labrador; Extended Data 
Table 2). The chosen cities are large enough to be resolved by the climate model, 
except for five small cities (Helena, Montana; Augusta, Maine; Whitehorse, Yukon; 
Yellowknife, Northwest Territories; Iqaluit, Nunavut). These cities span a popu- 
lation range of 7,000-379,300. In addition, we avoided the cities on hilly terrain. 

The US precipitation data were obtained from PRISM (PRISM Climate Group, 
Oregon State University; http://prism.oregonstate.edu). The precipitation data for 
cities in Canada were obtained from Environment Canada (http://climate.weather. 
gc.ca/). The PRISM data sets are elevation-corrected grid estimates of monthly, 
yearly and event-based climatic variables. The precipitation data for Canadian cities 
are station measurements. 

The population data were obtained from the US Census 2010 (http://quickfacts. 

census.gov) and Canada 2011 Census from Statistics Canada (http://www12.statcan. 
gc.ca/census-recensement/2011/dp-pd/index-eng.cfm). 
Climate model and simulation. We used NCAR’s climate model CESM”* (Com- 
munity Earth System Model) to simulate the UHI in the United States and in Canada. 
In this model system, the land surface processes are represented by the Community 
Land Model”? (CLM). We used CLM version 4.0. In CLM, the land surfaces are 
categorized into five land units: vegetation, glacier, wetland, urban and lake. Each 
grid cell can have one or more of these land units. The surface radiation and energy 
balance equations are solved separately for these land units, and the results are ag- 
gregated to yield grid cell means. Specifically, the urban land unit is modelled using 
a ‘canyon’ structure and consists of the following subsurfaces: roof, sunlit wall, shaded 
wall, and pervious (for example bare soil) and impervious (for example road, side- 
walk and car park) canyon floor. It should be noted here that there is potentially 
more evaporation from the pervious canyon floor in the urban land unit than from 
the comparable bare soil in the rural land, because all of the water in the bare soil 
column is available for evaporation in the urban land unit. The vegetated land unit 
corresponds to nonurban or rural land. This land unit may contain up to 15 dif- 
ferent plant functional types and bare soil. 

The model was run in the offline mode (uncoupled from an active atmospheric 
model). The urban and rural parameterizations in each grid cell were driven by the 
same atmospheric forcing. The atmospheric forcing data used in this study is a 


careful reconstruction of the climatology from 1972 to 2004”. It was derived from 
a combination of the NCEP-NCAR reanalysis”, observation-based analyses and 
observational records. Therefore, the data set has an improved accuracy compared 
with the NCEP-NCAR reanalysis. We ran the model for 33 yr from 1972 to 2004 
after a 60 yr spin-up. The simulation was conducted at the finest resolution as 
a standard model configuration supported by this version of the model (0.23° 
longitude X 0.31° latitude), to resolve individual cities. We note that even at this 
finest resolution the grid cell is still large enough that the total urban area in a grid 
cell can be a combination of several urban areas. The surface skin temperature was 
determined from the emitted long-wave radiation for each land unit, with an emis- 
sivity of 0.88 for the urban land unit and 0.96 for the vegetated land unit. The urban 
emissivity is the mean value of the weighted averages of the emissivity values of the 
urban subsurfaces prescribed in the model for the selected cities. The rural emis- 
sivity is the mean value of the weighted average of the vegetation and soil emissivity. 

To construct the UHI, urban and rural flux and state variables were extracted 
from the model output at the grid cells where the selected cities reside. CLM invokes 
the urban parameterization only if the urban area fraction exceeds a threshold of 
0.1%. Therefore, five small cities used in the MODIS data analysis (Helena, Mon- 
tana; Augusta, Maine; Whitehorse, Yukon; Yellowknife, Northwest Territories; 
Iqaluit, Nunavut) are neglected by the model. 

We included only the modelled data at 1:00 and 13:00 local time each day in this 
analysis; these times were selected to match closely the MODIS overpass times. To 
replicate the MODIS clear-sky conditions, we excluded cloudy days whose clear- 
ness index*’ was less than 0.5. We then converted the daily values into eight-day 
averages. The gap filling and processes of calculating annual means are the same as 
for the MODIS data. Under all-sky conditions, the modelled UHI intensity is on 
average 0.59 K lower during the daytime and 0.02 K lower at night than the clear- 
sky values, and the pattern regarding the component contributions remains un- 
changed from the clear-sky plot (Fig. 2). 

We note that the climate model cannot explicitly capture population depend- 
ence (Fig. 1). This is because population size is not a model parameter and the heat 
advection occurs at subgrid scales not resolved by its one-dimensional parameter- 
ization of land-atmosphere interactions. 

Attribution of UHI. We used a surface energy balance analysis to isolate the con- 
tribution to the model-predicted AT from each individual biophysical factor asso- 
ciated with urban land conversion. In this analysis, the rural land is regarded as the 
base state, and urbanization is a perturbation to this base state. The perturbation 
signal is denoted by A. For example, AT = T, - T,, where T, is urban surface tem- 
perature and T, is rural surface temperature within the same model grid cell. Follow- 
ing the method of ref. 19, the solution of the UHI intensity can be approximated by 
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where Tis the surface temperature, 4) = 1/4e0T” is the local climate sensitivity, fis 
the energy redistribution factor, R* is the apparent net radiation, pis the air density, 
C, is the specific heat of air at constant pressure, r, is the aerodynamic resistance to 
heat diffusion, f is the Bowen ratio, a is the surface albedo, K, is the incoming solar 
radiation, L 1 is the incoming long-wave radiation, cis the surface emissivity, ¢ is the 
Stefan-Boltzmann constant, T, is the air temperature at a reference height. In this 
analysis, we assume that r,, 8, Rn* and Q, and Qay are parameters associated with 
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the external perturbation (land use conversion) and are independent of T; the 
partial derivative of these variables can then be calculated. 

In equation (1), the terms on the right-hand side represent, in order from the 
first to the last, contributions from changes in radiation balance (term 1), aerody- 
namic resistance (term 2), Bowen ratio (term 3), and heat storage (term 4) and from 
anthropogenic heat addition (term 5). Because r, is the resistance to sensible or 
convection heat flux, term 2 is essentially a measure of change in the convection 
efficiency between urban and rural land. In an abstract sense, changes in f (term 3) 
can result from changes in sensible heat flux (H), latent heat flux (LE) or both (see 
equation (2), below). In the present context of partial differentiation, however, H 
and LE are not independent because the delta term Af, is evaluated with the net 
radiation Re and other valuables held constant. Thus, a reduction in f is accomp- 
lished by channelling more radiation energy to the surface latent heat flux, and it is 
appropriate to attribute term 3 to changes in surface evaporative cooling. 

The calculation was performed separately for 1:00 and 13:00 local time, with 
cloudy days omitted. Three sets of variables were used. The first set comes directly 
from the forcing data and includes precipitation, incoming solar radiation (K,), 
reference-height air temperature (T,; air temperature at 30 m above the surface), 
air pressure and downward long-wave radiation (L,). The second set has model- 
predicted variables, including reflected short-wave radiation (aK,), sensible heat 
flux (H), latent heat flux (LE), storage heat flux (Q,) and anthropogenic heat flux 
(Qau). The third set of variables, including surface temperature (T), air density (p), 
Bowen ratio (f) and aerodynamic resistance (r,) were derived from the forcing data 
and the model-predicted variables. Specifically, the Bowen ratio was calculated as 


H 
= — 2 
p= 0) 
and the aerodynamic resistance to heat diffusion was calculated from 
C)(T —Ta 
_ oto (3) 
H 


The aerodynamic resistance determined from equation (3) is the sum of the dif- 
fusion resistance in the atmospheric surface layer and the excess resistance assoc- 
iated with the thermal roughness**”. The urban and rural land units within each 
model grid cell have the same forcing variables and have different values for the 
second and third sets of the variables. It should be noted here that the model un- 
derestimates the anthropogenic heat flux (Qay) owing to the primitive anthropo- 
genic heat scheme. The total anthropogenic heat in the model includes only heating 
and air conditioning (HAC) fluxes, waste heat generated by HAC and the heat 
removed by air conditioning. These fluxes are based on some prescribed parameters 
in the surface data set of CLM and calculated heat transfer into and out of roofs and 
walls. The heat flux due to traffic is neglected by the current version of the model’*. 

The sum of the component contributions is slightly lower than the modelled AT 
(Fig. 2) because high-order terms are ignored in the linearization of the surface 
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long-wave radiation term of the energy balance equation and nonlinear interactions 
among the factors are omitted in the analysis. Comparison between model-predicted 
AT and calculated AT (sum of the individual contributions) reveals excellent cor- 
relation for daytime (r = 0.88, P < 0.001) and night time (r = 0.55, P< 0.001). 
Covariance analysis. The covariance analysis was performed on modelled AT and 
its components against precipitation. Let Ca, Cy, Crp, C; and Cay be the contribu- 
tions from radiation, convection efficiency, evaporation, storage and anthropo- 
genic heat, respectively (terms 1 to 5 in equation (1)). Equation (1) can be rewritten 
as 


AT=Ca+Cyt+Cip+Cy+ Can +e 


where e is an error term arising from nonlinear interactions. Because the covar- 
iance operation is linear, the AT-precipitation covariance is equal to the sum of the 
covariance between each component and precipitation 


Cov(AT,P) = Cov(Cr,P) + Cov(Cy,P) 
+Cov(Cyz,P) + Cov(C, ,P) (4) 
+ Cov(Cau,P) + Cov(e,P) 


where P is precipitation. Equation (4) decomposes the total covariance between 
AT and precipitation into the covariance contribution from its five components 
and a residual error term. We presented covariance here rather than correlation 
coefficient because the correlation is not a linear operation. In Fig. 3 and Extended 
Data Fig. 3, we normalized the covariance between each component and precip- 
itation by the total AT-precipitation covariance. 

We applied this technique to the analysis of both spatial covariance and temporal 
covariance. In the analysis of spatial covariance, each data point is the climatic annual 
mean value of a city (Fig. 3). In the analysis of the temporal covariance at a city, each 
data point is the mean value for an individual year of that city (Extended Data Fig. 3). 
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Extended Data Figure 1 | Precipitation and population influences on 
MODIS-derived annual mean UHI intensity. a, Dependence of daytime UHI 
on population size (r = 0.27, P = 0.027). b, Dependence of night-time UHI on 
precipitation (r = 0.05, P = 0.70). Red, green and blue symbols denote cities 
with annual mean precipitations less than 500 mm, between 500 and 1,100 mm, 
and over 1,100 mm, respectively. The solid line in a is the linear regression 
fit to the data. Parameter bounds for the regression slope are the 95% 
confidence interval. 
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Extended Data Figure 2 | Time series of MODIS and model-predicted daytime AT and annual precipitation. a, Billings, Montana. b, Richmond, Virginia. 
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Extended Data Figure 3 | Relationship between interannual variations in 
model-predicted daytime AT and precipitation. a, Correlation of AT and the 
individual biophysical components with annual precipitation at Billings, 
Montana. b, Same as in a except for Richmond, Virginia. c, AT-precipitation 
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temporal covariance explained by different biophysical factors at Billings, 
Montana. d, Same as in c except for Richmond, Virginia. Lines are best linear 
regression fits to the data points. Parameter bounds for the regression slope are 
the 95% confidence interval. 
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Extended Data Figure 4 | Albedo influence on annual mean night-time UHI _ with all data points). The four outliers in the upper right corner of b are coastal 
intensity. a, Dependence of night-time MODIS-derived UHI on white-sky cities (Olympia, Washington; Seattle, Washington; Salem, Oregon; Vancouver, 
albedo difference (that is, urban albedo minus rural albedo; r = —0.60, British Columbia) that have high biases of the modelled AT compared to 
P<0.001). b, Dependence of night-time modelled UHI on modelled albedo the MODIS AT. Lines are linear regression fits to the data. Parameter bounds 
difference (r = —0.56, P<0.001 excluding four outliers; r= —0.18, P = 0.16 for the regression slope are the 95% confidence interval. 
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Extended Data Table 1 


Urban parameters of a city pair in CLM 
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City Billings 
State Montana 
Latitude (°) 37.53 45.79 
Longitude (°) -77.42 -108.54 
Canyon Height/Width 0.48 0.48 
Mean building height (m) 12 
Roof thickness (m) 0.15 
Wall thickness (m) 0.28 
Wind height in canyon (m) | 6 | 6 
Roof fraction 0.50 
Pervious road fraction | 0.66 | 0.64 
Emissivity (Impervious road) 0.91 
Emissivity (pervious road) 0.95 
Emissivity (roof) 0.65 
Emissivity (wall) 0.91 
Albedo (Impervious road) 0.13 
Albedo (pervious road) 0.08 0.08 
Albedo (roof) 0.30 0.30 
Albedo (wall) 0.34 0.34 
Roof thermal conductivity (Wm! K") 0.84 0.84 
Wall thermal conductivity (W m™ K"') 1.06 
Impervious road thermal conductivity (W m’! K”) 1.67 
Layers of impervious road 2 
Roof heat capacity (MJ m® K’!) 0.76 
Wall heat capacity (MJ m® K”') 0.81 
Impervious road heat capacity (MJ m? K"') 2.06 2.06 
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Extended Data Table 2 | Size statistics for selected cities in the United States and in Canada 


City size Cit City size 
City, State/Province | Population | in CLM State oe Population | in CLM 
(km’) (km*) 


Albany, NY Louisville, KY | 7.41E+05 
Albuquerque, NM 5.53E+05 25.24 Madison, WI 2.33E+05 2. 73 
Montgomery, AL_| 2.06E+05 
Nampa, ID 8.16E+04 


Bismarck, ND 6.13E+04 pega City, | sgog+0s | 10.72 
Boise, ID Esai 12.84 Olympia, WA ueceee 12.94 
Boston, MA 6.18E+0 86.42 | Philadelphia, PA | 1.53E+0 71.51 
Calgary, AB 1. ir 21.48 Phoenix, AZ, 1. ss 53.91 


Casper, WY Pierre, SD 0.89 
Cheyenne, WY : Portland, OR i 54.50 
sane nie 


Providence, RI 16.06 


Sonia OH Behnond, VA 8. 00 
Dallas, TX Sacramento, CA 36.27 
Denver, CO Saint John, NB 1.42 


Hartford, CT oa 25. 03 Saskatoon, SK Saas 13.07 
= Hitena it [298-00 [Na Seattle, WA | 6.216405 


Indianapolis, IN | 8.30E+05 | 37.56 Topeka, KS 1.27E+05 
Iqaluit, NU 6. OE +03 | NA | Toronto, ON re 2k +06 215.29 


Jackson, MS 15.76 Trenton, NJ 
Jefferson City, MO 3.54 Tucson, AZ, 
Lansing, MI 6.47 ‘VerGauyer, BE 


Lincoln, NE Winnipeg, MB 
Little Rock, AR ‘ : Yellowknife, NT 
Los Angeles, CA 
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he Japanese government recently 

presented a confronting picture 

of the state of science, technology 
and innovation in the country. 

Ina 2015 report on its Fifth Science 
and Technology Basic Plan, Japan's 
Council for Science, Technology and 
Innovation conceded its world standing 
in science and technology was falling. 
Government R&D investment growth 
has also stagnated in the past 10 years 
compared to spending by the world’s 
other leading nations. “Our research 
papers are dropping in international 
rank, in quantitative and qualitative 
terms,’ the report said. “There have 
been delays in building an international 
network, and our science and 
technology activity is regrettably starting 
to fall behind the world leaders.” 

The Nature Index is a powerful tool to 
examine the nuances of this situation. 
While the database provides a broad 
overview of the country’s research 
performance in the natural sciences, it 
also offers insights into the high-quality 
output and collaboration trends of 
individual institutions. 

Japan is still a very strong scientific 
country with seven universities in 
the top 100 in the Nature Index. By 
some measures, the index reflects the 
government’ recent assessment of 
the diminished state of science and 
innovation, it also reveals that some of 
Japan's institutions have increased their 
output of high-quality science. 


NATURE INDEX 2016 JAPAN 

The Nature Index 2016 Japan, a supplement to Nature, is 
produced by Nature Publishing Group, a division of Macmillan 
Publishers Ltd. This publication is based on data from the 
Nature Index, a website maintained by Nature Publishing 
Group and made freely available at natureindex.com. 
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Japan’s overall output in the index has 
decreased by 12% since 2012, and the 
country is ranked in the database's top 
five countries. There is also evidence 
that the government’s agenda to 
globalize universities and encourage 
international partnerships is working. 
In the past year, connections with nine 
ofits top 10 collaborating countries 
have increased. 

In our first index supplement 
dedicated to Japan, we present the 
performance of individual institutions 
in three categories. 

On page S114 we profile the country’s 
top universities by weighted fractional 
count. These institutions are publishing 
the largest portion of Japan's best 
research. We look at the country’s 
rising stars on page S118. These are the 
institutions with the biggest growth in 
high-quality output between 2012 and 
2015. Some are small or new, but their 
improvement surpasses many older and 
better-resourced universities. 

Japan’s institutions have improved 
their collaborative reach and the article 
on page S127 looks at some institutions 
whose research partnerships have 
resulted in the most publications in 
high-impact journals, measured bya 
metric known as collaboration score. For 
more information on how Nature Index 
metrics are calculated, see page S136. 


Nicky Phillips 
Editor, Nature Index 
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A NETWORK OF KNOWLEDGE 


Research breakthroughs are a global pursuit, and while there is healthy rivalry between 
nations, Japan and its strongest competitors are increasingly joining forces. 


DATA ANALYSIS BY LARISSA KOGLECK 


COLLABORATIONS 


Japan retained its position as one of the top 10 countries in the Nature Index 
for its collaborations with domestic and international institutions in 2015. 
‘The scope of Japan’s partnerships grew between 2014 and 2015, after 
remaining fairly static since 2012. This phenomenon is not specific to Japan. 
With the exception of China, which has been growing consistently, most 
countries showed little movement in their overall output of collaborations 
between 2012 and 2014. Collaboration score is a measure of the contribution 


of authors from different institutions to every paper included in the index. ae STATES 
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Germany has 
experienced 
solid growth 
since 2012. 


@ JAPAN 
After three years of 
slow growth, Japan's 
collaborations 
increased in 2015. 
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UNITED STATES 


As Japan's top collaborator in the 
ature Index, it is little surprise that 
the USA is also the country with 
which it has shared the most science 

obel Prizes. The two countries have 
been joint recipients of six Nobels 
since 2000, most in chemistry. In 
2010 Japanese chemists, Akira 
Suzuki and Ei-ichi Negishi, the latter 
having spent most of his career in the 
USA, shared the chemistry prize with 
the American, Richard Heck, for their 
work synthesizing complex carbon 
molecules widely used in drug 
manufacturing. 
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UNITED KINGDOM 


The two science 
Nobels that Japan 
has shared with the 
UK since 2000 — 
in medicine or 
physiology in 2012 
and 2015 — paint 
a picture of 
collaboration 
between the two. 
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SWITZERLAND 


In 2002 Japan received the 
physics prize and chemist, 
Koichi Tanaka, shared the 
chemistry prize with 
Switzerland’s Kurt Wuthrich 
and the American John 
Fenn for their development 
of methods for identifying 
and analysing biological 
macromolecules. 


ITALY 


The single prize shared 
between Japan and Italy 
was for discoveries in 
astrophysics. The Italian 
Riccardo Giacconi and 
Japan’s Masatoshi Koshiba 
were awarded the physics 
prize in 2002, principally 
for their effort in detecting 
cosmic neutrinos. 


SOUTH KOREA 
CS: 149 


AUSTRALIA 
CS: 106 


JAPAN 


Japan boasts the second 
most Nobel Prize 
Laureates in the natural 
sciences in the 21st 
century. Since 2000 15 
Japanese scientists have 
been awarded science 
Nobels — eight in 
physics, five in chemistry 
and two in physiology or 
medicine — and ten 
prizes have been shared 
with scientists from 
countries that represent 
some of Japan's strongest 
collaborations in the 
Nature Index. 
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CHINA 


In 2015 Japan and China 
shared their first Nobel Prize. 
Japanese biochemist, Satoshi 
Omura, along with his Chinese 
counterpart, Tu Youyou, were 
two of three scientists awarded 
the medicine prize for their 
discoveries concerning therapies 
for malaria or infections caused 
by roundworms. 
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Researchers working on electrochemical energy conversion search for fuel cell breakthroughs at the I?CNER at Kyushu University. 


FLAGGING FORTUNES INVITE REFORM 


Japan’s quest to retain its status as a global research leader by enticing foreign students 
and faculty is the right response, say observers, but efforts are yet to have animpact. 


BY ICHIKO FUYUNO 


he landlocked former Soviet republic 

| of Uzbekistan seems an odd choice of 

location for a Japanese education fair, 

but, in November 2015, representatives from 

seven leading Japanese universities travelled 

to Tashkent, the Uzbek capital, for their fifth 
annual event. 

Presented by Nagoya University, it drew hun- 
dreds of local students interested in the opportu- 
nity to study at prestigious Japanese institutions 
without having to speak the language. 

About 100 Uzbek students come to Japan 
every year, including about 40 to Nagoya, 
but Yoshihito Watanabe, the vice president of 
Nagoya University, would like to attract more. 

“Their interest in Japanese education is 
probably the highest among other countries 
where we organize similar events,’ says Wata- 
nabe, who is in charge of international educa- 
tion at his university. 
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From France and the United Kingdom to 
China, India and Thailand, Nagoya University 
and other leading Japanese universities hold 
similar promotional events around the world. 
They aim to strengthen communication with 
education officials and local students, and 
entice them to study in Japan. 

This strategy responds partly to a sense 
among the Japanese research community that 
its research output and impact has stagnated in 
recent years. While Japan ranked fifth in the 
Nature Index in 2015, the country’s perfor- 
mance in the database 
of the world’s best nat- 
ural science research 
is slipping. Since 
2012, the proportion 
of Japanese contri- 
butions to papers in 
68 high-impact jour- 
nals included in the 
index — its weighted 
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fractional count (WFC) — has fallen 12% (see 
Top 10 Index performers). The performances of 
the United States and France have also declined. 

After decades of keeping universities insular, 
Japan has realized that diversity and collabora- 
tion are essential to producing world-class sci- 
ence and research. To revive Japan's academic 
competitiveness, the Ministry of Education, 
Culture, Sports, Science and Technology 
(MEXT) has been pushing its higher education 
sector to attract more foreign staff and students. 
Simultaneously, it has been implementing 
reforms to make its research environment more 
competitive with the best research universities 
in the world. “Internationalization changes the 
internal structure of universities, which makes 
them more flexible and on par with global 
norms,’ says Petros Sofronis, director of the 
WPI (World Premier International Research 
Center Initiative) International Institute for 
Carbon-Neutral Energy Research (I°CNER) 
at Kyushu University. 


I2CNER 


“The Japanese government is doing the right 
thing,” says Sofronis. 

But many policy analysts and senior 
researchers say these efforts aren't enough. 
They say more reforms are needed to allow 
research institutions and universities to make 
decisions about their own structure. “Tf the 
government is really determined to raise the 
level of globalization, current strategies are 
insufficient,” says Hiroshi Nagano, visiting 
fellow at the Japan Science and Technology 
Agency, who is a specialist in international 
science polices. 


DECLINING TREND 

For more than a decade, the Japanese govern- 
ment has tightened its annual budget for sci- 
ence amid the country’s weak economy and 
growing social welfare demands. The 2016 
budget for science and technology is set at 3.45 
trillion yen, down 6.4% from its peak in 2012. 

The government’s R&D investment 
accounted for 19.5% of the country’s total 
R&D spend in 2013, the lowest among major 
countries including China, South Korea and 
the United States. 

The president of Suzuka University of Medi- 
cal Sciences, Nagayasu Toyoda, who observes 
Japanese science policies closely, says the effect 
has been “deteriorated support for fundamen- 
tal research activities, the foundation of our 
country’s strength.” 

And while the Japanese government, since 
2005, has increased the value of competitive 
grants awarded to scientists for research pro- 
jects, it has slashed the key funding pool that 
national universities and research institutions 
rely on to make themselves more competitive, 
known as Management Expenses Grants. 

It’s not just within the Nature Index that 
Japan’s ranking has slipped. According to a 
recent report by the National Institute of Sci- 
ence and Technology Policy (NISTEP), the 
number of Japanese papers cited in the top 1% 
of all journal articles published between 2011 
and 2013 fell to seventh in the world, down 
from fourth a decade ago. 


OPENING UP 

The Japanese government's efforts to open up 
higher education are not new, but they have 
accelerated in the past few years. While the 
budget for science and technology has fallen, 
the money available for higher education insti- 
tutions to globalize has surged from 1.9 bil- 
lion yen in 2006 to 9.34 billion yen in 2016. 
Between 2009 and 2013, the MEXT established 
the Global 30 project, funding 13 leading uni- 
versities to create degree courses in English 
that would attract foreign students amid inten- 
sifying recruitment competition with other 
countries. As the number of courses taught in 
English at these universities grew from a hand- 
ful in 2009 to more than 300 now, the number 
of overall overseas students grew from 23,083 
in 2009 to more than 28,000 by 2013, including 


Prospective students head for an entrance examination at The University of Tokyo’s Hongo campus. 
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THE TOP 10 PERFORMERS IN THE NATURE INDEX IN 2015 


From 2012 to 2015, these 10 countries have contributed the most 
to the publications tracked by the Nature Index. The WFC of most 
nations has stagnated or declined. 


20,000 


17,500 


15,000 


12,500 


— 


As with many other 
top-10 performers, 
the US has seen a 
continued downwards 
trend in its output in 
the Nature Index over 
the years. 


China has continued to 
raise its output in the 
Nature Index, slowly 
widening the gap 
between itself and the 
rest of the pack. 
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United States 
China 
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United Kingdom 

@ Japan 
France 

© Canada 

@ Switzerland 
South Korea 


@ Italy 


Weighted fractional count (WFC) 
is a metric that apportions credit 
for each article according to the 
affiliations of the contributing 
authors. Article count (AC) is the 
number of articles in the index 
that have at least one author from 
that country or institution. 


Ss: | 
Japan has 
experienced a 
steady decrease in 
its output in the 
Nature Index 


world, down from 
fourth in 2014, 
behind the US, 
China, Germany 
and the UK. 
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those enrolled at short-term programmes and 
other non-Global 30 courses. But this modest 
increase in international students cost Japa- 
nese taxpayers 14.7 billion yen over five years, 
and by 2013 the country’s percentage of for- 
eign students, 3.4%, remained well below the 
OECD average of 9%. The increase in foreign 
students is also yet to be reflected in the out- 
put of high-quality research for most Global 30 
participants (see below). 


NEXT STEP 
While the Global 30 project succeeded in 
increasing the number of university courses 
offered in English, the MEXT launched 
another programme in 2014 to catapult more 
Japanese universities into the world’s top 100 
institutions. The Top Global University Pro- 
ject, often referred to as the Super Global 
University programme, funds 37 universi- 
ties to change their organizational structures, 
partly to deepen ties with international insti- 
tutes. Thirteen “Type A universities — many 
of which are among the top ten institutions 
in the Nature Index in Japan in 2015 — will 
receive up to 500 million yen a year until 
2023 to become a top 100 university in rank- 
ings such as the Times Higher Education list. 
Many Type A members are striving to create 
joint degrees with top-level overseas institu- 
tions. In April 2015, Tohoku University estab- 
lished the world’s first graduate programme 
in spintronics in collaboration with Johannes 
Gutenberg- Universitat Mainz. The spintronics 
course fits with Japan's strength in physical sci- 
ences, evidence of which is seen in the Nature 
Index (see Subject Split, $102). 

Another 24 “Type B’ institutions will be 


given up to 300 mil- “We have to 
lion yen touse their cregtenew 
eka autor i measures to 
contri ute to loca assess how 
acs ae ae , universities 
Perhaps Japan’s * 

bring 
most successful pro- re 

Cc ompetitive 
gramme to promote 1 
internationalization Personne 
is the WPI, which "to the global 
launchedin2007and S0Ctety. 


comprises nine insti- 
tutes. The official language of these centres 
is English and an average 40% of researchers 
are from abroad. WPI institutes accounted for 
4.63% of the world’s most highly cited papers 
between 2007 and 2013, the third largest contri- 
bution after the Rockefeller University and the 
Massachusetts Institute of Technology (MIT). 
But the executive director of RIKEN, 
Yoichiro Matsumoto, says there are few indica- 
tors to assess the success of these programmes. 
“The government evaluates the impact of pro- 
grammes like Global 30 only by numbers. But 
we have to create new measures to assess how 
universities bring competitive personnel into 
the global society,’ says Matsumoto. 


FUNDAMENTAL PROBLEMS 

Charles Yokoyama, director of research 
administration at the RIKEN’s Brain Science 
Institute (BSI), says that despite Japan’s efforts 
to attract overseas staff and students, it’s hard 
for the country to keep up. 

“Globalization activities are slower than 
they can or should be to keep up with global 
competition,” he says. The RIKEN BSI is one 
of the most international institutions in Japan 


GLOBAL 30 PROJECT UNIVERSITIES IN THE NATURE INDEX 


The MEXT Global 30 project funded 13 leading universities to create degree courses in English that would attract foreign students. 
The courses surged from a handful in 2009 to more than 300 now. 


The University of Tokyo (UTokyo) 
Kyoto University 
Osaka University 
Tohoku University 
Nagoya University 
Kyushu University 
Keio University 
University of Tsukuba 
Waseda University 
Doshisha University 
Ritsumeikan University 
Sophia University 

Meiji University 
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of the institutions 
in the Global 30 
project have also 
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its successor: 

the Top Global 
University 
programme. 
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yet only a few postdocs are trained, or awarded 
faculty positions at elite US universities. The 
pool of talented researchers largely bypasses 
Japan, says Yokoyama. 

Hiring excellent researchers is key to attract- 
ing top-level students from abroad, but “Japan 
can be no match for the United States and 
other major countries so far in terms of the 
level of salaries and welfare packages for fami- 
lies” adds Takaaki Kajita, director of the Insti- 
tute for Cosmic Ray Research at the University 
of Tokyo. 

While Japan may struggle to compete with 
other nations in terms of recruitment, Japanese 
researchers are reaching out to partner with 
their global peers. Between 2014 and 2015, 
Japan’s collaborations with China, the United 
Kingdom, France and Germany grew. Its part- 
nerships with the United States also increased, 
an upsurge to a three-year decline (see Japan’s 
top partnerships). 

Shizuo Akira, director of the WPI Immu- 
nology Frontier Research Center at Osaka Uni- 
versity, says many young Japanese researchers 
are unwilling to move labs or go abroad for 
postdoc positions and miss opportunities to 
learn from different leaders. Akira, one of the 
world’s most cited immunologists, says young 
Japanese researchers are reluctant to move as 
competition for positions in Japan is intense 
after returning from abroad. “Unless we estab- 
lish a practical career path that presents young 
fellows as visibly prominent scientists on the 
global stage, there is little hope for the future 
of Japanese science.” 

The government is aware of these issues 
and is implementing various measures to 
strengthen ties with overseas collaborators. 


i 


LEADING THE PACK 


The University of 
Tokyo (UTokyo) 
contributes 
more than 30% 
of the Global 30 
project's WFC 
output in the 
Nature Index in 
2015. 


350 400 450 500 


The MEXT is in the process of revising Japan's 
biggest competitive grant, the Grants-in- 
Aid for Scientific Research, so it can provide 
greater support for joint international research. 
In addition, the ministry set aside 5.6 bil- 
lion yen this year to accelerate intellectual 
exchange, allowing about 520 young Japanese 
researchers to work abroad for two years, and 
1,100 overseas researchers to come to Japan. 

But Nagano worries these measures are 
relatively small due to the government's tight 
science budget, and many have not led to a rise 
in Japan’s rankings. “If Japan wants to raise its 
global ranking, we need to recruit good stu- 
dents. To do that, we need good faculty. But 
there are no measures that combine these two 
points,” he says. 

Meanwhile, Yokoyama observes that Japan’s 
traditional solutions to globalize research often 
focus on the recruitment of international visi- 
tors such as students and junior faculty, but 
more basic reforms of domestic graduate 
programmes are needed for Japan to develop 
a globally competitive STEM (science, technol- 
ogy, engineering and mathematics) workforce. 

“The level, approach and pace of globali- 
zation of Japan’s graduate programmes have 
fallen short of elite global standards including 
the key issue of training in high-level English 
communication,” he says. 

Atsushi Sunami, the government's advisor 
for science policies at the National Gradu- 
ate Institute for Policy Studies, says compre- 
hensive reforms of national universities are 
currently underway to reorganize faculties, 
strengthen governance and review funding 
and personnel systems, but their effect will 
take time. m 
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a ~ eT ce fe 4 
Top: Students work on a solution at the Earth Life Science Institute, a designated WPI centre. 
Bottom: A system for testing the properties of spintronics components at Tohoku University. 


Japan's most prolific collaboration partnerships are with the United States, Germany and China. The largest collaboration 
with the United States has seen a slight decline, while partnerships with Germany and China have grown. 
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TOKYO TECH 


TOHOKU UNIVERSITY 


The Kamioka Gravitational Wave Detector (KAGRA), housed in a giant tunnel, is a project of University of Tokyo’s Institute for Cosmic Ray Research. 


EYE ON THE PRIZE, BUT GRASP LOOSENS 


Inmost of the past 15 years, Japanese scientists have beennamed Nobel Laureates for work in 
chemistry and physics, but illustrious awards may be harder to come by in the next era. 


BY TIM HORNYAK 


odern Japanese research is a tale 
of two eras. While the nation’s sci- 
entists have been recognized with 


Nobel Prizes for work in fields such as chem- 
istry and physics in the majority of the last 15 
years, some observers predict that such awards 
will be more elusive for the next 15. 

University of Tokyo physicist, Takaaki 
Kajita, the most recent laureate, was honoured, 
along with Canada’s Arthur B. McDonald, in 
2015 for showing that neutrinos can oscillate, 
demonstrating that they also have mass — a 
revelation that defied the standard model of 
particle physics. 

The achievement solidified the University 
of Tokyo’s dominant position over other Japa- 
nese research centres. It holds the country’s top 
spot in 2015 in the Nature Index, a database 
of the contributions scientists have made to 
68 high-quality science journals. The index 
tallies the annual number of scholarly papers 
affiliated with a particular institution, the rela- 
tive contribution of each author to a particu- 
lar study, known as the fractional count, and 
the weighted fractional count (WFC), which 
adjusts for the relative abundance of papers 
in astronomy and astrophysics. Between 2014 
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and 2015, the University of Tokyo’s WFC rose 
6%, the largest increase of any institution in the 
Nature Index top 10. 

Perhaps unsurprisingly, seven of the top 10 
Japanese institutions in the index form the 
state-backed National Seven Universities, a 
group analogous with the Ivy League schools 
in the United States. 

The top 10 carry out research supported by 
a combination of government funding and 
collaborations with the private sector, though 
the latter has been taking on an increasingly 
significant role. 

Despite the country’s impressive Nobel 
Prize cabinet, by other measures of success 
there may be cause for concern. Between 
2012 and 2015 the output of high-quality sci- 
ence counted in the Nature Index declined 

for eight out of Japan's 
top 10 institutions. 
This follows a wider 
trend within Japanese 
institutions where 
the overall number 
of papers published 
in five natural sci- 
ence disciplines has 
fallen in recent years, 
according to an 
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analysis by Nagayasu Toyoda, president of 
Suzuka University of Medical Sciences. 

“The number of scientific articles in Japan 
normalized by population or GDP is less 
than half that of many OECD countries,” says 
Toyoda, who anticipates fewer Nobel Prizes 
for Japan. 

“The main reason for the decline in research 
activity at Japanese universities is budget cuts 
to basic subsidies to national universities by 
about 1% or more per year during the past 10 
years and beyond.” 

This will have an impact on Japanese sci- 
ence, as top researchers have historically been 
able to leverage their reputation to try to break 
new ground. For instance, Kajita has been 
working toward the direct detection ofa gravi- 
tational wave. Predicted by Einstein’s theory of 
relativity, gravitational waves can distort space- 
time in minute quantities — equivalent to the 
diameter of a hydrogen atom for the distance 
between the Earth and the Sun. 

The recent detection of gravitational waves 
could be the beginning ofa new kind of astron- 
omy. Kajita is supervising a project to build a 
3-km laser interferometer. “The beginning of 
the universe is an ultimate goal, but for the 
moment our hope is to observe the formation 
of black holes.” 
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TOP OF THE NATION 


Japan's top 10 overall performers in the Nature Index 
in 2015 include the National Seven Universities, 
government funded research institutions and 
up-and-coming research universities. 


UNIVERSITY OF TOKYO 
2015 WFC: 486 


KYOTO UNIVERSITY 
2015 WFC: 270 


OSAKA UNIVERSITY 
2015 WFC: 194 


TOHOKU UNIVERSITY 
2015 WFC: 176 


RIKEN 
2015 WFC: 137 


NAGOYA UNIVERSITY 
2015 WFC: 127 


TOKYO INSTITUTE OF TECHNOLOGY 
2015 WFC: 120 


HOKKAIDO UNIVERSITY 
2015 WFC: 103 


KYUSHU UNIVERSITY 
2015 WFC: 85 


See RIKEN profile 
on $128 
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NATIONAL INSTITUTE FOR MATERIAL SCIENCE 
2015 WFC: 82 


10 


WFC output from 2012 to 2015 for Japan's top 5 
universities in the Nature Index. 
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The University of Tokyo has been a stronghold of excellence in Japanese science since its establishment. 


UNIVERSITY OF TOKYO 
2015 WFC: 486 = 2015 AC: 1,377 
2015 NATURE INDEX ACADEMIC RANKING: 3 


Measuring the performance of University of Tokyo in a global context over time. The line graph below 
compares University of Tokyo (purple) with peers with a similar WFC output in the Nature Index 2015. 


The donut shows WFC output by subject area in 2015. 
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Founded in the decade after Japan’s emer- 
gence from feudalism in 1868, the University 
of Tokyo has long presided over the ranks of 
Japanese centres of learning. 


PHYSICAL 
SCIENCES 
2015 WFC: 216 


CHEMISTRY 
2015 WFC: 164 
y 


STRENGTH: 
© PHYSICAL SCIENCE 


LIFE SCIENCES 
2015 WFC: 107 


*Subjects may overlap. The sum of subject area WFCs 
may therefore exceed the institution’s overall WFC. 


EARTH & 
ENVIRONMENTAL 
2015 WFC: 32 


Promotion of Science (JSPS), more than half 

is used to support fundamental research, and 

16% goes to support young researchers. 
“Research today is ever more spe- 


Today, it comprises 10 faculties, “embers of cialized and increasingly requires 
15 graduate schools, 13 research tye University the efforts of teams working across 
centers and facilities located of Tokyo disciplinary boundaries,” says 
throughout Japan. Amongits gyre active Kazuo Hotate, executive vice presi- 
scholars are eight Nobel Prize gorge the dent for research at the university. 
winners and one Fields Medal full spectr “One of our great strengths is that 
. Oren te pectrum mae 
winner. To maintain its research of human members of the University of Tokyo 
output, the university relies on endeavour.” are active across the full spectrum 


research grants and collabora- 
tive research with industry for 
about one-third of its budget. Of its 23.5 bil- 
lion Grants-in-Aid for Scientific Research 
from the state-backed Japan Society for the 
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of human endeavour, providing 
immense opportunity for inter- 
disciplinary collaboration both within the 


university and with our partners across Japan 
and the world” 
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KYOTO UNIVERSITY 
2015 WFC: 270 = 2015 AC: 715 


2015 NATURE INDEX ACADEMIC RANKING: 14 
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Like the University of 


Tokyo, Kyoto Univer- 
sity has produced a 
number of outstand- 
ing researchers since 
its establishment in 
the late 19th century 
— nine have won a 
Nobel Prize, includ- 
ing Shinya Yamanaka 
in 2012 for discover- 
ing that mature cells 
can be reprogrammed 
into so-called induced 
pluripotent (iPS) 
cells (described in 
the journal Cell). 
Yamanaka wrote about 
streamlining iPS cell research in 
Nature in 2009. Today, Kyoto 
has 14 research institutes includ- 
ing the Center for iPS Research 
and Application (CiRA), where 
Yamanaka is director. Follow- 
ing partial privatization in 2004, 
along with all other national 
universities, government block 
funding has been decreasing. 
Kyoto University has responded 
by increasing 
joint projects 


“KU has with corporate 
tended to partners as well 
attract as taking on 
the best corporate-spon- 
minds in sored research, 
Japanese which now 
academia.” accounts for 
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*Subjects may overlap. The sum of 
subject area WFCs may therefore 
exceed the institution’s overall WFC. 


Neurons derived from iPS cells, whose discovery 
earned Shinya Yamanaka a Nobel Prize in 2012. 


about 42% of the research budget. 
An example of the embrace of 
industry is T-CiRA, a 10-year 
collaboration between CiRA and 
Takeda Pharmaceutical Com- 
pany, Japan’s largest drug maker, 
to develop clinical applications of 
iPS cells. “This system of innova- 
tion is seen as being in tune with 
the city of Kyoto, where tradition 
combines with creativity to result 
in an environment fostering ven- 
ture businesses,” says the univer- 
sity spokesman, David Hajime 
Kornhauser. “KU has tended to 
attract the best minds in Japanese 
academia: those who are seeking 
a place to pursue their ideas freely 
and express their creative spirit to 
the utmost”. 
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OSAKA UNIVERSITY 
2015 WFC: 193 = 2015 AC: 532 


2015 NATURE INDEX ACADEMIC RANKING: 34 


Osaka University is known for 
its immunology discoveries. 
It was at this institution that 
the emeritus professor, Tad- 
amitsu Kishimoto, identified the 
immune protein interleukin-6 
in the 1980s. Today the univer- 
sity is trying to clear a path to the 
next breakthroughs in the field 
through the Osaka University 
Immunology Frontier Research 
Center (IFReC). Opened in 2007 
as one of the Japanese govern- 
ment’s “globally visible” World 
Premier Institute Research Insti- 
tutes, IFReC aims to broaden the 
scope of immunology research to 
include study of cells in the body 
and come up with solutions to 
immune diseases including 
vaccine development. The staff 
of more than 20 researchers in 
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PHYSICAL 
SCIENCES 
2015 WFC: 48 


* Shizuo Akira, 
a leader in 
the field of 
immunology. 


immunology, bioinformatics and 
imaging have elucidated links 
between the immune system and 
metabolic syndromes including 
lipolysis and gout, described in 

2013. Direc- 
“In 2016,I tor, Shizuo 
expectrobust Akira, said 
achievements the centre 


inthese fields.” had pub- 
lished more 


than 1,000 papers on a wide vari- 
ety of immunology fields, almost 
10% of them in high-impact 
journals. Akira, who studied in 
Kishimoto’ lab, said: “Our labo- 
ratory is actively involved in the 
determination of new subsets of 
macrophage and regulation of 
post RNA transcription. In 2016, 
I expect robust achievements in 
these fields” 
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subject area WFCs may therefore 
exceed the institution’s overall WFC. 
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TOHOKU UNIVERSITY 
2015 WFC: 176 ©2015 AC: 429 


2015 NATURE INDEX ACADEMIC RANKING: 38 
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Hideo Ohno spent 10 years 
researching semiconductors in 
his hometown of Sapporo before 
he became curious about the pos- 
sibilities of combining magnets 
with these indispensable devices 
of the information age. In 1994 
he began to explore this in ear- 
nest, when he joined the faculty 
of Tohoku University’s Depart- 
ment of Electronic Engineering. 
Ohno is now the director of the 
Center for Spintronics Integrated 
Systems. “Spintronics is a broad 
science and engineering field, 
in which we utilize the ‘spin’ (a 
magnetic property) of the elec- 
tron,’ says Ohno. Ina 2015 Nature 
Physics paper, Ohno and collab- 
orators described the so-called 
‘creep slow motion of domain 
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2015 


Hideo Ohno uses a 
stereomicroscope 
in the study of 
spintronics at 
Tohoku University. 
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*Subjects may overlap. The sum of 
subject area WFCs may therefore 
exceed the institution’s overall WFC. 


walls in ferromagnets. As a practi- 
cal application, spintronics could 
transform household appliances 
and factory equipment into 
‘smart’ devices that collect and 
exchange data via the internet. 
It has been used to make high- 
performance nonvolatile memory 
storage, which doesn’t require 
energy to retain information. 
This helps countless processors 
and controllers stand by without 
power, an essential quality for 
the ‘Internet of things. Joining 
Tohoku gave Ohno the opportu- 
nity to lead a group, and access 
the best equipment to develop 
new materials and study spin- 
tronics. So what does Ohno have 
in store for 2016? “This is top 
secret,’ he says. 
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JAPAN | NATURE INDEX 
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Like every kid growing up 
in Hamamatsu City, Hiroshi 
Amano knew about hometown 
legend Kenjiro Takayanagi, a 
high-school teacher and inven- 
tor who created the first fully 
electronic TV screen in 1926. But 
Amano also dreamed of exceeding 
his achievement. The spark came 
when he studied engineering at 
Nagoya University and became 
interested in PCs. He wondered 
whether compact light-emitting 
diodes (LEDs) could replace the 
bulky cathode-ray tubes that were 
used in Takayanagi’s early TVs as 


Blue LEDs illuminate the Nagoya TV 
tower to celebrate their launch. 
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«Subjects may overlap. The sum of 
subject area WFCs may therefore 
exceed the institution’s overall WFC. 


well as PC displays. “I believed 
that if could develop a blue LED, 
I would change the world,” Amano 
says. “Of course, I was a fledgling 
student and I knew nothing about 
the difficulty of the subject.” His 
boldness paid off. In 1985, Amano 
produced the world’s first high- 
luminance blue LED, along with 
his supervisor, Isamu Akasaki, 
who focused on gallium nitride as 
a promising material. His research 
has also appeared in Nature 
Materials. The achievement won 
them the 2014 Nobel Prize in 
Physics, shared with Shuji Naka- 
mura. Today blue LEDs are used 
in everything from mobile phone 
backlights to energy-efficient 
traffic signals and lights for plant 
cultivation. 

Nagoya is a national univer- 
sity that traces its roots back toa 
medical school founded in 1871 
ina samurai court. These days the 
university's strengths are chemis- 
try and physical sciences. 

With a relatively light teaching 
load compared to faculty at private 
universities, Amano says he can 
concentrate on his research which 
is now focused on the po10tial of 
deep ultraviolet (DUV) LEDs for 
sterilization.“We could provide 
safe water for 2.4 billion people 
without safe sanitation and 663 
million people who cannot access 
safe drinking water,’ he says. m 
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SWIMMING AGAINST THE TIDE 


The rising stars on Japan’s research landscape include traditional big -hitters and newer 
institutions selected for initiatives to boost their global standing and promote autonomy. 


BY MARK ZASTROW 


counted among Japan's rising stars — those 

that have gained the most in the Nature 
Index since 2012 in terms of the absolute or 
relative increase in their weighted fractional 
count (WFC). 

Four of the index’s top 10 institutions are 
part of the country’s flagship national univer- 
sity system, including three of the top four. 
The rest are split between governmental or 
semi-governmental research institutions. and 
fully private universities. 

Many of the institutions have been selected 
for various government programmes intended 
to boost their global standing and international- 
ize their faculty. The top two, the Tokyo Institute 
of Technology (Tokyo Tech) and Okayama Uni- 
versity, are members of the Top Global Univer- 
sity Project, often referred to as the Super Global 
Universities programme, which started in 2014. 
And Doshisha University, a private institution 
in Kyoto, was also one of the beneficiaries of that 
programme’ forerunner, the Global 30. 


[once both public and private are 
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Tokyo Tech also hosts the Earth-Life 
Science Institute, which focuses on fundamen- 
tal research on the origins of life on Earth and 
is one of nine high-profile centres funded by 
the government's World Premier International 
Research Center (WPI) Initiative. 

These initiatives can be seen as part of the 
Japanese government policy that holds that 


“universities and institutes should have their 
own individuality,’ says Kumi Okuwada, a sen- 
ior fellow at the National Institute of Science 
and Technology Policy (NISTEP). However, 
the rise of these institutions takes place against 
a backdrop of concern that the nation’s research 
funding and output is stagnating. 
Nevertheless, government-backed institutions 


These 10 institutions have shown the largest increase in WFC in the Nature Index from 2012 to 2015. 
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Doshisha High Energy Japan Saitama National 
University Accelerator Atomic University Institute of 
Research Energy Information & 
Organization Agency Communications 


Technology 


are well-represented among those which have 
an increasing high-quality research output. 
One such organization is the Japan Agency 
for Marine-Earth Science and Technology 
(JAMSTEC). With headquarters in Yokosuka, 
south of Yokohama, it maintains offices on both 
sides of the Pacific and operates more than a 
dozen research ships and remotely operated 
vehicles. That includes the drilling vessel 
Chikyu. In 2012 it set a then-world record for 
the deepest hole in deep-sea drilling and, in the 
aftermath of the 2011 Tohoku earthquake and 
tsunami, investigated the fault that caused it. 

Another government organization rising 
in the rankings is the Japan Atomic Energy 
Agency, the nation’s nuclear R&D organiza- 
tion. Based in Tokai, north of Tokyo, it operates 
more than a dozen research institutes, employ- 
ing a staff of more than 4,000. In addition to 
its ongoing nuclear power research (including 
participation in the international ITER fusion 
reactor), it has devoted resources in recent years 
to the containment and clean-up of the Fukush- 
ima Daiichi reactor following the 2011 quake. 

The other government research institute 
in the top ten — the High Energy Accelera- 
tor Research Organization (KEK)—can count 
a Nobel Laureate among its recent faculty: 
Makoto Kobayashi received the physics prize 
in 2008 for his work on symmetry violation in 
particle physics. Only one other private uni- 
versity made the list—Chubu University near 
Nagoya, which has colleges in biotechnology, 
health sciences, and engineering. 


TOKYO INSTITUTE OF TECHNOLOGY 
2015 WFC: 120 2015 AC: 


Japan's fastest-growing research institute in 
the Nature Index is also the largest technical 
university: the Tokyo Institute of Technology. 
Its WEC increased from 112 in 2012 to 120 in 
2015. More than 40% of its revenue comes from 
public funds, and it has a research budget for 
commissioned projects in 2015-16 of 13.8 bil- 
lion yen (US$123 million). 

Tokyo Tech’s executive vice president of 
research, Makoto Ando, credits the univer- 
sity’s comprehensive range of research inter- 
ests as key to its rise in the Nature Index, citing 
research in materials science, cell biology, and 
fibre optic communication — as well as the 
Tsubame supercomputer, which the univer- 
sity owns and operates. Tsubame is the world’s 
second-most energy efficient supercomputer, 
one of the fastest owned by a university, and 
has powered compelling research ranging from 
measuring blood flow in drug simulations,to 
modelling seismic activity of the volatile Nan- 
kai Trough off the island of Honshu. 

The university operates 16 research insti- 
tutes, laboratories across its three campuses 
situated in the capital city and surrounds. Ando 
says reforms from April 2016 will include a 
focus on globalization: the university has nearly 
200 visiting international researchers, 10% of its 
students come from outside Japan, and it hopes 
to further boost those numbers. 
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JAMSTEC’S Chikyu, 
a giant of deep- 

sea drilling, was 

at the forefront of 
investigations into 
the fault that caused 
the 2011 Tohoku 
earthquake. 


PHYSICAL STRENGTH 


The line graph shows overall trends for both WFC and 
article count (AC) since 2012. The area plot highlights 
which subject has contributed most to the increase in WFC. 
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OKAYAMA UNIVERSITY 
2015 WFC: 2015 AC: 15 


Okayama University’s rise in the Nature Index 
is the second-largest in Japan, making it also the 
overall second-most prolific institute on the list. 
Its vice president of research, Shinichi Yama- 
moto, credits this to a focused effort to selec- 
tively fund distinctive research projects, helped 
by funding from the nation’s science ministry, 
which allowed it to create an organization for 
interdisciplinary studies. 

One of Okayamais strengths is in physics, from 
high-energy physics to its geophysical research 
institute. The latter’s expertise is also being 
applied to the study of other worlds. For exam- 
ple, a recent study in The Astrophysical Journal 
coauthored by Okayama’s George Hashimoto 
proposed a new target for astronomers hunting 
for exoplanets: planets with an entirely molten 
surface, covered with magma oceans. Scien- 
tists expect that most rocky planets, including 
Earth-like ones, go through this phase early in 
their evolution. The team’s models showed that 
they cool significantly after a million years or 
less, rendering their heat signatures practically 
invisible. However, the steam rising from the 
still-warm magma would be highly reflective, 
making them promising targets for the upcom- 
ing generation of telescopes. 

Okayama’ other major research centre focuses 
on plant science and photosynthesis. Yamamoto 
says the university has plans to increase its efforts 
in medical research, cybersecurity and big data. 
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Top: The Tsubame supercomputer at Tokyo Tech has powered research ranging from blood 
flow in drug simulations to modelling seismic activity. 

Middle: An impression of the exoplanet Kepler-20e. These molten-covered planets are the 
subject of work by geophysicists at Okayama University. 

Bottom: Sado, off the coast of Honshu is the site of Niigata University’s marine biology station. 
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JAPAN AGENCY FOR MARINE-EARTH 
SCIENCE AND TECHNOLOGY 


2015 WFC: 22 2015 AC: 84 


Below the abyss, there is the hadal, as in Hades 
of the Greek underworld. This is the realm 
of the deepest sea trenches. “It is the least 
explored biosphere on Earth, says environ- 
mental microbiologist, Takuro Nunoura, of 
the Japan Agency for Marine-Earth Science 
and Technology (JAMSTEC). 

In the ocean's deep abyssal zone, food is 
scarce, and microbes eke out what energy they 
can mostly from rocks. The hadal is even less 
surveyed, and previous work had been con- 
ducted like classical species hunting. But in a 
2015 study published in the US Proceedings of 
the National Academy of Sciences, Nunoura 
and colleagues brought to bear the techniques 
of modern environmental microbiology. 

To his surprise, they found a thriving eco- 
system of microbes living off organic mat- 
ter at the bottom of the Mariana Trench. He 
suspects that organic matter collected on the 
trench’s slopes gets churned into the water dur- 
ing underwater earthquakes, nourishing this 
unique ecosystem. 

JAMSTEC, which has been the nation’s pre- 
miere oceanographic institute since its found- 
ing in 1971, rose in the index by 47% between 
2012 and 2015. Nunoura says this coincides 
with the retirement of the institute's first gen- 
eration of scientists, and the rise of younger 
scientists to leadership positions. 
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NIIGATA UNIVERSITY 
2015 WFC: 13 2015 AC: 53 


The port of Niigata is the largest city on the 
Sea of Japan — the more sparsely populated of 
the nation’s coasts—and Niigata University is 
one of the coast’s largest national universities 
with more than 10,000 undergraduates and 
2,200 graduate students. 

Not surprisingly, much of its research 
reflects regional interests or challenges. It 
operates a marine biological station on Sado 
Island, 50-km offshore. 

One of its major research centres, the 
Research Institute for Natural Hazards and 
Disaster Recovery, plays a key role in assessing 
earthquake and tsunami risk and preparations 
along the Sea of Japan coast and in the moun- 
tains just inland. It studied and responded to 
big quakes in the region in 2004 and 2006, 
but its work extends to many mountain- 
ous regions at high risk of natural disasters, 
including in Nepal in the aftermath of its 2015 
quake. 

But Niigata has global ambitions as well, 
including in its other major centre, the 
Brain Research Institute. Founded in 1967, it 
remains the only neuroscience research insti- 
tute at a Japanese national university. 

Its agriculture department also maintains 
a collaboration with Thailand to study and 
develop specialized varieties of rice, includ- 
ing low-protein rice intended to benefit those 
with kidney disease. 
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CHUBU UNIVERSITY 
2015 WFC: 7 2015 AC: 19 


Located in Kasugai, a suburb northeast of 
Nagoya, Chubu University was founded in 
1938 as a technical and engineering univer- 
sity. In 2001 it opened a college of biotech- 
nology and five years later added another for 
life and health sciences in 2006. Today it has 
more than 10,000 undergraduate students and 
250 graduate students. The university's WFC 
increased from a low base, one, in 2012 to 7.4 
by 2015. 

In recent years, the university has focused 
on recruiting top researchers including 
chemist Hisashi Yamamoto, who is known for 
his work in synthesizing organic compounds 
and moved from the University of Chicago to 
head Chubu’s Molecular Catalyst Research 
Center. 

One of the biggest projects on campus is 
Satoru Yamaguchi’s work on superconduct- 
ing power lines, which could deliver elec- 
tricity through underground pipelines much 
more efficiently than overhead power lines. 
He is currently collaborating with two Japa- 
nese power companies and an internet cloud 
provider on a test site on the northern island of 
Hokkaido, a haven for wind and solar power. 
In 2015, they powered a data centre from solar 
power using a 500-m superconducting cable, 
one of the world’s longest tests yet. By 2018, 
they hope to test a 2-km line that can connect 
to the commercial power grid. m 
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The Subaru telescope in Hawaii, managed by NINS, was integral in the discovery of five new planets and is a fine example of the benefits of international collaboration. 


TOP TEAMS TO BE RECKONED WITH 


Japan’s institututions are increasingly joining huge collaborative research efforts, 
realizing that, on the big questions facing science, a problem shared is a problem halved. 


BY SMRITI MALLAPATY, SIMON PLEASANTS 
AND NICKY PHILLIPS 


" “ol days, large-scale international 
collaborations are a key engine of 
cutting-edge science, and the past 12 

months have provided some prime examples. 
While a single paper in Physical Review Letters 
credited more than 5,000 researchers, breaking 
the record for the most authors ona single arti- 
cle, the 2016 Breakthrough Prize in Fundamen- 
tal Physics was shared between 1,377 scientists 
investigating neutrino oscillations. 

The Nature Index is well poised to evaluate 
this important phenomenon because it counts 
the number of authors from different institu- 
tions who contribute to every paper included 
in the index. This is used to assign a collabo- 
ration score to institutions and countries. In 
Japan, the data confirms the link between qual- 
ity science and collaborations. Four of the top 
five collaborating institutions in 2015 are also 
among the most prestigious — the University 


of Tokyo, RIKEN and the universities of Kyoto 
and Osaka. Rounding out the top five is the 
National Institutes of Natural Sciences (NINS), 
established by the government in 2004 to man- 
age expensive public facilities. In 2014, almost 
9,000 researchers from more than 50 countries 
used the facilities at NINS to conduct research. 
“This model of designating special institutes to 
care for large-scale and cutting-edge facilities 
shared by researchers is unique to Japan,’ says 
Amane Koizumi, a professor at NINS. Similar 
institutes are set up for large information sys- 
tems and high-energy particle accelerators. 
The prominence of NINS reflects how 
necessity is driving Japanese research institu- 
tions to seek collaboration. The 2016 budget 
for science and engineering is set at 3.45 tril- 
lion yen, down 6.4% from its peak in 2012. To 
help institutions cope, the government has 
introduced policies and programmemes to fos- 
ter collaboration and resource sharing among 
research institutes and private companies. 
Yasuo Miake, a vice president and executive 
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director for research at the University of 
Tsukuba, says another factor driving the col- 
laboration effort in Japan institutions is govern- 
ment pressure to contribute to social innovation. 
There has been a mindset among some Japanese 
professors that universities should conduct sci- 
ence for science’s sake and remain independent 
of government and industry, says Miake. “That 
mentality is changing now.” 

A government strategy released in 2013 
identified collaboration between industry, aca- 
demia and government as one of six essential 
principles for science policy, setting targets to 
double, by 2030, the number of joint projects 
larger than 10 million yen and projects span- 
ning more than three years. Among the insti- 
tutions whose partnership with industry lead 
to high-impact research, the National Institute 
of Advanced Industrial Science and Technol- 
ogy (AIST) received the index’s second-highest 
collaboration score for its partnership with 
industry in Japan in 2015, following the Uni- 
versity of Tokyo. 
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COLLABORATION CHAMPIONS 


Japan's institutions are not just reaching out to domestic partners, but also internationally. The country 
has a wide collaboration network of academic and corporate connections across the world. 


Top 10 overall collaborators 


The University of Tokyo 

Kyoto University 

RIKEN 

National Institutes of Natural Sciences 

Osaka University 

Nagoya University 

Tohoku University 

Tokyo Institute of Technology 

High Energy Accelerator Research Organization 


University of Tsukuba 


RIKEN 
2015 COLLABORATION SCORE: 801 


RIKEN is one of Japan's leading research insti- 
tutions with roots in basic research, and wants 
to broaden its connection with industry. It is 
third on the list of Japanese institutions with 
the highest collaboration scores in 2015. 

“RIKEN is a pure research organization, and 
one of our strengths is managing large projects 
and large facilities. These endeavours require 
collaboration,’ says a RIKEN executive direc- 
tor, Shigeo Koyasu. 

RIKEN’s focus on basic science has led to 
strong partnerships with other domestic and 
international universities with strengths in this 
type of research. In the index, RIKEN’s top two 
domestic collaborators are the basic research 
powerhouses, University of Tokyo and Kyoto 

University, with 


“One of our which it works 
strengths is closely on stem-cell 
managing large science. 
projects and _ RIKEN’s top 
large facilities. international col- 
These endeavours !4°rators in the 
require index are MIT in 
collaboration.” the United States 
: and the Max Planck 
Society in Germany, 


but the institution also has major partnerships 
with universities in China, Korea, India 
and Malaysia. 

Despite having roots in fundamental 
research, RIKEN is moving towards solidify- 
ing partnerships with industry, says Akihiro 
Fujita, the head of RIKEN’s Cluster for Indus- 
try Partnerships. “People in industry have not 
been sure how to approach RIKEN precisely 
because our work is in basic research, and 
this uncertainty may have tended to hinder 
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Scores represent 
institution's 


partnerships with 
industry only. 


Top 5 collaborators with companies 


Collaboration scores are 
based on all bilateral 
partnerships between 
Japanese institutions 
and their domestic 

and international 
counterparts in 2015. 
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The contribution of the top 10 
collaborators to Japan's total 
collaboration score. 


RIKEN's top 5 international collaboration hotspots; and RIKEN's top 5 collaborating institutions. 
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progress in establishing joint research projects 
with industry.” The Cluster for Industry Part- 
nerships aims to bridge that divide by encour- 
aging partnerships with the private sector. 
One member of this cluster, the RIKEN 
Innovation Center, supports researchers to 
work with private companies to transfer their 
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The aim of 
personnel at the 
RIKEN Innovation 
Center (RinC), 

is to support the 
transfer of RIKEN’s 
many scientific 
breakthroughs into 
commercial reality 
through partnerships 
with private 
companies. 


science into commercial products and services 
that solve a specific technological challenge. In 
addition, RIKEN’s five collaboration centres 
with Olympus, Toyota, Rigaku Corporation, 
Takeda Pharmaceutical and JEOL are expected 
to lead to new products from basic research 
findings. 


RIKEN 


UNIVERSITY OF TSUKUBA 


NATIONAL INSTITUTE FOR FUSION SCIENCE 


UNIVERSITY OF TSUKUBA 
2015 COLLABORATION SCORE: 366 


An hour by train north of Tokyo, the Univer- 
sity of Tsukuba has taken a structural approach 
to fostering collaboration. Among the newer 
national universities, Tsukuba has broken away 
from some traditional university practices that 
would have discouraged collaboration. In the 
past, academic faculties at some universities 
were organized into large hierarchical research 
units around a professor. At Tsukuba, groups 
are much smaller and there are no restrictions 
on younger assistant or associate professors 
becoming group leaders, or principal investiga- 
tors. “Because each research group is small, PIs 
have to look for outside collaborators to carry 


oo a 5 ca) 


Synthetic organic chemists at Tsukuba University are among the small research groups at the institution. 


out their research,” explains Yasuo Miake, a vice 
president and executive director for research at 
the University of Tsukuba. “Big and traditional 
universities have tremendous barriers between 
departments. Such barriers are very small at the 
University of Tsukuba, which allows research- 
ers to talk more easily across departments,’ he 
says. While the university has achieved a short- 
term goal of doubling its income from industry 
through expanded partnerships with compa- 
nies such as Toyota, Miake says there is more to 
achieve. Tsukuba is working to strengthen ties 
with the almost 100 industrial, academic and 
governmental organizations clustered at nearby 
Tsukuba Science City, some of which, like the 
National Institute of Advanced Industrial Science 
Technology (AIST), are already among Tsukuba 
top domestic collaborators in the index. 
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UNIVERSITY OF TSUKUBA’S PARTNERSHIPS 


University of Tsukuba's top 5 international 
collaboration hotspots; and top 5 collaborating 


institutions. 
UNITED STATES 
26% 


GERMANY 
1% 
ITALY FRANCE 
6% UNITED KINGDOM 8% | | 10% 


OWN CONTRIBUTION 
(=i e 
PARTNER CONTRIBUTIO 


as) 


REST OF 
te WORLD 
i TSUKUBA'S TOP 5 
COLLABORATIONS 


National 
Institute for 
Material Science 


The University 
of Tokyo 


National Institute of 
Advanced Industrial 
Science & Technology 


Kyoto 
University 


French National 
Centre for Scientific 
Research (CRNS) 


e) 5 10 15 
Collaboration score 


NATIONAL INSTITUTES OF NATURAL SCIENCES 
2015 COLLABORATION SCORE: 701 


It took a team of 16 researchers from seven insti- 
tutions, using three telescopes on two islands to 
discover five new Jupiter-like planets orbiting 
three large stars. The findings, published in The 
Astrophysical Journal, resulted from a 16-year 
collaboration between the National Astronomi- 
cal Observatory of Japan, part of the National 
Institutes of Natural Sciences (NINS), and col- 
leagues in Japan and the United States. Lead 


A Anni waa he 


NIFS’ Large Helical Device for fusion plasma research 


author Hiroki Harakawa from NAOJ says in 
radial-velocity planet searches, collaborating 
with researchers using other facilities adds to 
the number of observations required to confirm 
planetary systems among hundreds of targets. 
In this case, the observations were made using a 
telescope in Japan and two telescopes in Hawaii, 
one which belongs to the United States and the 
other to Japan, and managed by NINS. Among 
the many big research facilities it operates, the 
optical-infrared Subaru telescope on top ofa 
volcano in Hawaii is the jewel in its crown. 
NINS’ international research alliances may 


NINS’S PARTNERSHIPS 


continue to strengthen in the coming decade if 
the proposed construction ofa new 30-metre tel- 
escope on the same peak in Hawaii goes ahead. 
A joint project by institutes in India, the United 
States, China, Canada and Japan, it would be the 
world’s largest optical-infrared telescope . 

In 2015, NINS researchers teamed up with 
scientists at 490 international and 105 domes- 
tic institutions to publish papers featured in the 
Nature Index, a decline from 2014. Their main 
domestic collaborator was the University of 
Tokyo, while the Max Planck Society was their 
most successful international partnership. 


NINS's top 5 international collaboration hotspots; and top 5 collaborating institutions. 
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HIGH ENERGY ACCELERATOR 
RESEARCH ORGANIZATION (KEK) 


2015 COLLABORATION SCORE: 439 


Koichiro Nishikawa (left) collects his prize. 


When KEK professor, Koichiro Nishikawa, 
accepted this year’s Breakthrough Prize in Fun- 
damental Physics, he was doing so on behalf of 
one of five teams awarded for their neutrino 
experiments that spanned several decades. 
Nishikawa is a member of the T2K particle 
physics experiment, a successor to the K2K pro- 
gramme, a long-baseline neutrino oscillation 
experiment involving collaborations between 
several countries including Japan, Germany, 
Italy, South Korea, Spain, the United States and 
the United Kingdom. These countries represent 
seven of the top ten nations with whom Japan 
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KEK'S top 5 international collaboration hotspots; and top 5 collaborating institutions. 
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partnered to produce high-quality science 
included in the Nature Index in 2015. 

Another huge and successful collabora- 
tion involving KEK scientists is the ATLAS 
experiment, which observed a particle that 
behaved exactly like the illusive Higgs boson 
in 2012. 

About 200 Japanese scientists, including 14 
from KEK, were involved in major elements 
of the construction of the ALTAS detector, 
including the silicon strip detector to detect 
charged particles, the superconducting sole- 
noid magnet and the electronics for the muon 
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trigger chamber. “To build the large accelerator 
and the large detector, international collabora- 
tion is essential in terms of both budget and 
human power,’ says Kazunori Hanagaki, pro- 
fessor of the Institute of Particle and Nuclear 
Studies (IPNS) at KEK. 

In the Nature Index, KEK’s focus on high- 
energy physics reflects the main institutions with 
which its researchers collaborate. KEK scientists 
co-authored the most high-impact articles with 
counterparts at the INFN National Laboratory 
in Italy, publishing 141 papers in the 68 high- 
impact journals featured in the Nature Index. 
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It’s notoriously difficult to make a material that 
reflects light and is also thermally insulating. 
In August last year a collaboration between 
researchers from the National Institute of 
Advanced Industrial Science and Technology 
(AIST) and Kawaken Fine Chemicals Co. Ltd 
succeeded where others had failed. 

Taking inspiration from the layered arrange- 
ment of fish scales, the research team devised 
a simple method for producing an alumina 
film that strongly reflects light and has a high 
thermal insulation comparable to that of 
wool. The method was reported in Advanced 
Materials. 

The study demonstrates the importance of 
fostering collaborations across the academia— 
industry divide says Kenji Hata, the director of 
the CNT-Application Research Center at AIST. 

“Ten or twenty years ago, Japanese com- 
panies were very strong and would hire their 
own researchers, but they can't afford to do that 
anymore.” 

AIST was established in 2001 with a strong 
emphasis on applied science and collabora- 
tions with industry. The institution under- 
went a major reorganization in 2015 to better 
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align its projects to meet the needs of indus- 
try, including creating a new organization in 
charge of technology marketing. Today, in 
addition to its own 2,300 researchers, about 
1,800 scientists and engineers from private 
industry carry out research at AIST each year. 


AIST campus in Tsukuba 


In 2015, AIST’s most fruitful industry part- 
nership in the Nature Index was microscope 
manufacturer JEOL Ltd, followed by telephone 
communications company, NTT Group. = 
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A GUIDE TO THE NATURE INDEX 


A description of the terminology and methodology used in this supplement, 
and a guide to the functionality available free online at natureindex.com. 


he Nature Index is a database of 

| author affiliations and institutional 

relationships. The index tracks 

contributions to articles published in a group 

of highly selective science journals, chosen by 
an independent group of active researchers. 

The Nature Index provides absolute counts 
of publication productivity at the institutional 
and national level and, as such, is one indicator 
of global high-quality research output. 

Data in the Nature Index are updated 
monthly, with the most recent 12 months of 
data made available under a Creative Com- 
mons licence at natureindex.com. 

The database is compiled by Nature 
Publishing Group (NPG) in collaboration 
with Digital Science. 

The list of journals tracked by the Nature 
Index is under review, and from 2016 will be 
extended to include the clinical sciences. 


NATURE INDEX METRICS 

There are four measures provided by the 
Nature Index to track affiliation data. The sim- 
plest is the article count (AC). A country or 
institution is given an AC of 1 for each article 
that has at least one author from that country 
or institution. This is the case whether an arti- 
cle has one or a hundred authors, and it means 
that the same article can contribute to the AC 
of multiple countries or institutions. 

To get a sense of a country or institution's 
contribution to an article, and to remove the 
possibility of counting articles more than 
once, the Nature Index uses the fractional 
count (FC), which takes into account the 
relative contribution of each author to an 
article. The total FC available per paper is 1, 
which is shared between all authors under the 
assumption that each contributed equally. For 
instance, a paper with 10 authors means that 
each author receives an FC of 0.1. For authors 
who have joint affiliations, the individual FC 
is then split equally between each affiliation. 

The third measure used is the weighted 
fractional count (WFC), which applies a 
weighting to the FC to adjust for the over- 
representation of papers in astronomy and 
astrophysics. The four journals in these 
disciplines publish about 50% of all papers 
in international journals in this field — 
approximately five times the equivalent 
percentage for other fields. Therefore, 
although the data for astronomy and 
astrophysics are compiled in the same way 
as for all other disciplines, articles from these 
journals are assigned one-fifth the weight of 
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NATUREINDEX.COM 


A global indicator of high-quality research 


natureindex.com 
users can search for 
specific institutions 
or countries and 
generate their own 
reports, ordered by 
article count (AC), 
fractional count (FC) 
or weighted fractional 
count (WFC). 

Each query will 
return a profile page 
that lists the country 
or institution’s recent 
research outputs, 
from which it is 
possible to drill down 
for more information. 
For example, articles 
can be displayed by 
journal, and then by 
article title. As in the 
supplement, research 
outputs are organized 


Home 


Home / 


Institution name 
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Region: Global 
Subject/journal group: All 


Below, the same research outputs are grouped by 
subject. Click on the subject to drill-down into a list 
of articles organized by journal, and then by title. 


subject area. 


Institution outputs / Institution na 


The table to the right includes counts of all 
research outputs for Institution name published 
between 1 January 2014 - 31 December 2014 
which are tracked by the Nature Index. 


Note: Articles may be assigned to more than one 
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1 January 2014 - 31 December 2014 


WFC 


1221 598.04 558.30 


Outputs by subject 


by subject area. The Subject AC FC WFC 
profile page also I Chemistry 276 «179.1 179.14 
lists the institution || Earth & Environmental Sciences 95 42.73 42.73 


or country’s top 
collaborators, as well 
as its relationship 
with other research 
organizations. 


|| Life Sciences 


| | Physical Sciences 


other articles (i-e., the FC is multiplied by 0.2 
to derive the WFC). 

The total FC or WFC for an institution is 
calculated by summing the FC or WEC for 
individual authors. The fourth measure is the 
collaboration score (see The Supplement). 

The process is similar for countries, 
although complicated by the fact that some 
institutions have overseas labs that will be 
counted towards their host country totals. 
What's more, there is great variability in the 
way authors present their affiliations. Every 
effort is made to count affiliations consistently, 
with a background of reasonable assumptions. 

For more information on how the affiliation 
information is processed and counted, please 
see the FAQ section at natureindex.com. 
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439 231.50 231.50 


652 284.48 244.74 


THE SUPPLEMENT 

Nature Index 2016 Japan is based on data from 
the Nature Index, covering articles published 
during four consecutive years from 1 January 
2012 to 31 December 2015. 

Most analyses within the supplement use 
WFC as the primary metric, as it provides a 
more even basis for comparison across multiple 
disciplines, and in determining the relative 
contribution of each city or institution. Some 
sections and graphics also refer to collaboration 
score. This is a relatively new metric that is 
derived by adding the FC for all the bilateral 
relationships for that institution or country. If 
institution A has relationships with two others, 
BandC, then the collaboration score is the sum 
of FC for A+ Band A+C.m 
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Inspiring a new generation 
of innovators 


With the opening of its new Graduate 
School of Science, Technology and 
Innovation in early 2016, Kobe 
University will bring together ideas 
from across the sciences and commerce 
to forge new areas of innovation. 


obe city, located just 30 kilometres 

west of Osaka, is a major historical 
centre of commerce and one of the 

first Japanese port cities to open up to the 
rest of the world after the end of the coun- 
ry’s policy of isolation in the mid-1800s. 
This long commercial and ideologica 
interaction with the West has given Kobe 
a reputation of being an internationa 
and cosmopolitan city — an important 
player in the modernization of Japan and 
a place where great things happen. And it 
was in this melting pot in 1902 that Kobe 
University has its origins. 
“Kobe University started as a highe 
commercial school,” says President Hiroshi 
Takeda. “This commercial tradition formed 
the groundwork for some of Japan’s top 
achievements in education and research, 
particularly in business administration and 
economics, and we have produced many 
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of the business leaders who helped lay the 
foundations of modern Japan.” 
At the same time, Kobe University has 
built an international reputation for educa- 
ion and research in the natural sciences, 
ife sciences and medicine, from biopro- 
duction to advanced membrane technol- 
ogy, infectious disease research, synthetic 
biology and planetology. One of Japan’s 
greatest contributions to the life sciences — 
he discovery and development of induced 
pluripotent stem (iPS) cells — was made by 
Kobe University’s Shinya Yamanaka, who 
received the 2012 Nobel Prize in Physiology 
or Medicine for the achievement. 
“While continuing to work 
more in-depth research in each 


owards 
field, we 
also emphasize combinations of and col- 
laborations between different academic 
areas by engaging in various advanced 
and interdisciplinary research projects,” 
says Takeda. “In 2007, we created a 
unique interdisciplinary organization, the 
Organization of Advanced Science and 
Technology, to streamline graduate school 
education and promote advanced re- 
search and collaboration across five of our 
science graduate schools. The Graduate 


President Hiroshi Takeda 


School of Science, Technology and 
Innovation will further support advanced 
interdisciplinary research in the natural 
sciences as well as equip our students 
with the entrepreneurial skills needed to 
commercialize academic research results.” 


Bridging the gap 

The new Graduate School of Science, 
Technology and Innovation is unique in 
Japan in embedding education and ca- 
pacity-building in innovation, commercial- 
ization, entrepreneurship and enterprise 
into a graduate science programme. With 
this initiative, Kobe University is aiming to 
nurture a new generation of science pro- 
fessionals with a commercial mindset and 
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International collaboration 


Both the number (orange bars) and proportion (blue line) of internation- 
ally co-authored papers are increasing at Kobe University. Researchers 
at the university are actively engaging in international collaborations 
across a wide range of fields from astrophysics to economics. 
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International Affairs Planning Division 
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Anational innovation hub 
The holistic integration of research, deve 
opment and commerce at Kobe University 
does not end at the campus gates. The uni- 
versity’s collaboration and joint research 
activity extends well into Kobe city and 
beyond — an area awash with national re- 
search institutes and world-class research 
facilities, including the Port Island super 
science cluster, the SPring-8 high-energy 
particle accelerator and SACLA X-ray free 
electron laser, and the K computer — one 
of the fastest supercomputers in the world. 
“Kobe University successfully blends 
radition with innovation, and humanities 
with the social and natural sciences,” says 
Takeda. “We provide special education 
programmes, including some offered en- 
irely in English, with the aim of producing 
globally minded students. And we pursue 
innovation through — interdisciplinary 
research. In this way, we will continue to 
strive to meet the challenge of creating 
new values in order to solve current issues 
and support the society of the future.” 
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To wander with 


purpose 


A tradition of unorthodox thinking has 
defined Kyoto University over its 120- 
year history. The freedom to explore un- 
conventional ideas allows cutting-edge 
science to flourish at the institution. 


resident Juichi Yamagiwa’s ambition 

is to keep Kyoto University a safe 
haven for free thinking: free from 
societal pressures, free from prejudice and 
free from existing paradigms. 
n present-day Japan, people tend 

to be treated harshly when they think 
outside the box,” Yamagiwa observes. 
“Universities should protect students 
and researchers from that. Free thinking 
arising from the liberal atmosphere here 
helps pioneer the future.” 
any come to Kyoto for the opportunit 
o conceive and explore radical ideas. The 


«| 


include physicist Hideki Yukawa (second 
photo from bottom), the first Japanese 
obel laureate, and Kitaro Nishida, who 
founded the Kyoto school of philosophy. 
Yamagiwa is also a pioneer, spending years 
e African jungle living with gorillas and 
dying human evolution. 

Ultimately, Yamagiwa plans to make 
Kyoto an incubator for ‘frontier science’ — 
progressive research based on interdisci- 
plinary communication. “Frontier science 
should transcend national borders and 
also the differences between the arts and 
he sciences,” he says, reaffirming his inten- 
ion to spread Kyoto’s distinctive approach 
o the rest of the world. “Our researchers 
don’t just follow the cutting edge. They are 
he pioneers that plough the field for oth- 
ers to follow,” he emphasizes. 


Animals and technology 
And plowing through the field of great 


ape behavior is Fumihiro Kano, who 


employs the latest technologies, includ- 
ing eye tracking and thermal sensing, to 
his research. In a recent study, Kano used 
eye-tracking technology to examine how 
well apes recall one-off events stored in 
heir long-term memories. Kano’s team re- 
corded where the apes looked while view- 
ing video clips starring a colleague dressed 
as a ‘bad’ gorilla. The results demonstrated 
hat apes can remember events they had 
seen or experienced only once. “Studying 
animal behavior is no longer just about 
direct observation and long-term training,” 
Kano explains. 
eanwhile, Kano’s colleague Masaki 
Tomonaga has been testing the visual per- 
ception of ponies using touch screens. In 


his study, ponies discriminate between di 
ferent shapes by nuzzling a touch screen. 
Comparison of these results with those o 
dies involving other mammals reveals 
hat ponies see the world as humans do, 
S 
C 


ays. “New technology is developing 
ly. lt opens up So many options.” 


Peering into germ-cell development 
Behavioral scientists are not the only ones 
finding new uses for the latest advances. 
Kotaro Sasaki is breaking new ground 
with induced pluripotent stem (iPS) cells 
whose discovery led to Kyoto University’s 
Shinya Yamanaka being awarded the 2012 
obel Prize in Physiology or Medicine. 

Pluripotent stem cells have been used 
O generate just about every cell type in 
he body except germ cells, which include 
sperm and ova. Unlike other stem-cell- 
based therapies, which affect only the 
patient, those that induce germ cells could 
also benefit the patient’s offspring. 


Germ cells 
of the ethical 
in experimen 


are difficult to 
and technical 
ation. “When 


study because 
issues involved 
read about his 


work, | knew | had to come back to Japan,” 
says Sasaki, referring to the work of lab 
leader Mitinori Saitou. Sasaki, who was a 
renowned pathologist in the United States, 
returned to join Saitou’s team in Kyoto. 
Sasaki spent a month at the Center for 
iPS Research and Application learning how 
to work with human iPS cells and about 
the developmental process of human 
germ cells. In addition to illustrating key 
transcription interactions and _ signaling 
events, Sasaki gained insight into how epi- 
genetic marks — traits that are inherited 
without changes to the DNA sequence — 
are ‘erased’ at the beginning of germ-cell 
development. 
The team’s model, still in its early stages, 
may form a foundation for continuing 
studies on germ cell lineage. “By further 
reconstituting human germ-cell develop- 


ment in vitro,” explains Saitou, “we may 
be able to elucidate the mechanisms 
hroughout the developmental process, 
from embryo to adult.” 


‘Seeing’ black holes with visible light 

n the field of astronomy, Mariko Kimura 
second photo from top on facing page), 
a master’s student, took full advantage of 
yoto University’s international network 
when she witnessed a major event unfold 


in front of her eyes. 


Our locations: 


; Ty 
Kyoto University 
maintains research 
and observation 
centres at locations 
throughout Japan 
(pictured), from 
Hokkaido to 
Kagoshima, as well 
as numerous offices 
and facilities 
overseas. 


Kimura called affiliated astronomers 
when the binary black-hole system V404 
Cygni generated an outburst, an event tha 
occurs only a few times a century. Within 
minutes, she had international collabora- 
ors supporting her with unprecedented 
amounts of observational data. “We now 
now that we can make observations 


based on optical rays — visible light, in 
other words — and that black holes can 
be observed without high-spec X-ray o 
gamma-ray telescopes,” Kimura says. 

Her study indicates that the main fac- 
or triggering repetitive activity around 
black holes is not the quantity of mass 
accretion, but rather the length of orbita 
periods. “The study would not have come 
0 fruition,” says Daisaku Nogami, leader o 
he research unit, “were it not for the read- 
ily available network Kyoto astronomers 
have established through years of effort 
with a view to advancing mutual support 
in observation.” 

“Stars can only be observed afte 
dark, and there are only so many hours 
each night, but by making observations 
from different locations around the 
globe we're able to gather data more 
comprehensively,” elaborates Nogami. 


Searching for where happiness 
happens 

In Yamagiwa’s haven for free thinking, 
other researchers delve into a question 
left unanswered for millennia — our own 
well-being. 

Throughout human history, eminent 
scholars such as Aristotle have contem- 
plated happiness, says Wataru Sato. He is 
termined to find answers with his own 
st and with the help of functional mag- 
etic resonance imaging (fMRI). “Emotions 
rise from our subconscious mind and 
irit”” says Sato. “People bond through 
motions, but we can’t access our subcon- 
ious minds directly. Being able to under- 
and that part of ourselves could help us 
ve problems associated with the mind 
nd spirit — including how to be happy.” 
At the Kokoro Research Center, schol- 
ars like Sato explore the mind and spirit, 
with some examining the topic from a 
philosophical perspective, and others, like 
Sato, integrating neurological insights in 
their studies. 
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President Juichi Yamagiwa 


Using fMRI, Sato has identified the part 
of the brain that ‘feels’ happiness and has 
clarified what factors constitute happi- 
ness. According to his study, happiness is 
a combination of happy emotions and sat- 
isfaction with life coming together in the 
precuneus, a region in the medial parietal 
lobe that becomes active when experienc- 
ing consciousness. 

For Yamagiwa, this is only the beginning. 
Even in the face of the uncertainty of this 
globalized era, Yamagiwa believes that 
Kyoto can lead the world with its creativity. 


And he is not merely talking about edgy 
science. “We wander in academic freedom 
with a purpose,” Yamagiwa says. “These 
innovations will lead to solutions for a har- 
monious, sustainable, global society.” 


Contact 


www.kyoto-u.ac.jp/en 
www.facebook.com/Kyoto.Univ 
www.twitter.com/KyotoU_News 
Tel: 075-753-2071 

E-mail: 
comms@mail2.adm.kyoto-u.ac.jp 
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UNIVERSITY 


r : Our ethos Is always to look ahead, by seeking to 
understand the challenges, issues and needs of 
coming generations. The art of solving tomorrow's 
problems today requires us to be relentlessly 
innovative, open to experiment, and committed to 
working at the leading edge of current scientific 
thought and discovery. 


Visionary research 
Kyushu University is conducting a number of cutting-edge research projects. Dedicated to the vision of a green future, 
we have developed an array of exciting, world-leading programs in the area of clean energy technology. Among these 
research programs are those devoted to carbon-neutral energy, wind power systems, and next-generation fuel cells. 


The heart of the environment 

Although Kyushu University is over 100 years old, our recently established Ito campus forms the new heart of our 

academic environment. Ito is the biggest single campus in Japan and features state-of-the-art facilities for research and iB 
education. Ito provides not only a thriving, self-contained community, but also easy access to the vibrant, cosmopolitan Nii Z 
city of Fukuoka, with the nearby sea, mountains and forests of the region offering ample opportunities to enjoy 

beautiful natural surroundings. 


: Se 
Connections 
Kyushu University is cultivating Ito as a hub connecting both local and global communities. The business sector is 
strongly represented, with companies maintaining their laboratories on-campus, facilitating a seamless transaction KYUS H U 
between academic research and commercial development and application. We also have extensive connections and UNIVERSITY 


research partnerships within the local community, such as on-site experimentation at nearby farming villages and 
participation in the ICT utilization project for social security purposes. www.kyushu-u.ac.jp 
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Japan’s research 
universities go global 


The Research University Network of — international cooperation of Japan’s re- forces to address common issues facing 
Japan opens anewwindowtotheworld. — search universities. universities. To aid communication in a 
n 2013, Sato and directors from 25 re- — global context, RUNJ has proposed the 

s a pioneer of ‘inflation theory, search universities and institutes across creation of a Japan portal for EurekAlert!, 
Katsuhiko Sato spent much of his = Japan agreed to establish the Research an online science news service provided 
career grappling with questions University Network of Japan (RUNJ), an by the American Association for the 
about the origins of the Universe. Now, — organization that aims to enhance inter- Advancement of Science. This highly 
as president of the National Institutes of | national collaboration and further rela- — visible communication channel would 


Natural Sciences (NINS), he is comple- ionships between Japanese researchers allow Japanese research universities to 
menting his cosmological concerns with — and their counterparts onthe globalstage. disseminate bilingual information to a 
more earthly matters: expanding the RUNJ was established as a part of the —_ wide global audience. RUNJ’s Task Force 
inistry of Education, Culture, Sports, on International Collaboration has also 
Gateway between Japan and the world Science and Technology’s Program for established Gateway, an email-based 
The Research University Network of Japan Promoting the Enhancement of Research — system that will expedite the circulation 
(RUNJ) seeks to promote international col- Universities. This government initiative of information concerning research col- 
laboration with Japan's research universities. supports efforts to link Japanese research —_laboration, offering a new approach to 
HESS 1 Pio leie OMY MSH go universities and form a network of sharing — problem-solving with an eye towards in- 
networking. : : ; 
and mutual support. It includes 19 univer- _ ternational research development. 
A sities, together with NINS and two other Together, these RUNJ initiatives are 
contact@runetwork.jp Rene ; : 
members of the Inter-University Research opening a new window to the world for 
> Access several strategically placed nstitute Corporation (IURIC). IURIC is a | Japanese researchers, allowing the forg- 
administrators who interface with : J roar h F ? ees A : 
eratice. unique Japanese organization that aims ing of richer and stronger connections 
: ee 0 create a joint forum for researchers and with their international peers. 
> Coordinate joint workshops and h Z : ft with th : 
seminars and encourage international otner university stair with tne ultimate 
collaboration. goal of enhancing the research capabili- 
EurekAlert! Japan Portal nowacts asa ies of academic institutions throughou 
portal between Japan’s research universi- he country. 
ties and the world “Speaking with a united voice, we can 
RUNJ has collaborated with the American be much more effective in connecting noe sAiideail belaacae 
Association for Advancement of Science to re- with our global peers,” Sato emphasizes. 
design their EurekAlert! and to launch a Japa- “Our goal is to act not as single entities, 
nese news portal to disseminate our bilingual ; 
press releases to the site’s global audience Baad) united CiOup Or depalisseri 
j search universities, communicating our Contact 


EurekAlert! Japan portal: activities and recommendations not only E-mail: contact@runetwork.jp 
www.eurekalert.org/language/ to Japan but also around the world.” Phone: +81-3-5425-1300 
japanese.php RUNJ has ambitions to engage in con- —— NINS: www.nins.jp/english/index.php 


crete activities, such as establishing task RUNJ: www.runetwork.jp/english/ 
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AIE 


A stu ent- 


approach to education 


OIST is forging a new path of graduate 
education — one that its students help 
to create 


he Okinawa Institute of Science and 

Technology Graduate University 

(OIST) recognizes that many great 
discoveries lie at the intersections of the 
major sciences, where researchers make 
novel connections between disparate 
disciplines. Accordingly, it has developed 
a graduate programme for students to be- 
come highly proficient in their core disci- 
pline while also learning to communicate 
scientifically with people in completely 
different disciplines, such as quantum 
physics and cell biology. 
“We want our students to achieve first- 
class research outcomes, to creatively 
address important scientific questions 


and to grow to their full potential as inde- 
pendent scientists playing leading roles in 
research,” says Jeff Wickens, dean of the 
OIST Graduate School. “We encourage an 
international perspective and growth that 
is unrestricted by the traditional boundar- 
ies between disciplines. 
Freedom is a vital condition of an 
environment that fosters creativity. At 
OIST, students are not admitted to spe- 
cific laboratories, but are given a year to 
decide in which research unit they will do 
heir thesis work. Students may start the 
programme with only a bachelor’s degree. 
they come with a master’s degree, they 
can get credit for their additional back- 
ground, but they still have a year to choose 
heir thesis lab. 

The PhD programme is individualized 
with a flexible curriculum that can be 
ailored for each student. Every student 
is treated as a unique individual and can 
select courses without the constraints of 


raditional departments. To help students 
make the most of the opportunities and 
develop a coherent programme of stud- 
ies, they are assigned an experienced 
faculty member as an academic mentor 
o guide their course of study and assist 
hem choose a thesis lab. Small classes, 
a low student-to-faculty ratio of just two 


o one and collaborative principles let the 
students shape how classes are taught. 
f OIST does not offer a recommended 
course, the student can work with a tutor 
in guided independent study or with visit- 
ing professors in special topics. 
Instead of having traditional depart- 
mental structures, OIST has concentra- 
ions in several fields, which, while distinct, 
permit cross-disciplinary interactions. 
OIST has faculty in chemistry; physics; 
mathematical and computational scienc- 


es; systems biology and bioinformatics; 
molecular, cellular and developmenta 
biology; neuroscience; and environmen- 
al and ecological sciences; and marine 
sciences. Students can take courses and 
conduct PhD projects in these fields and 
are encouraged to stretch beyond thei 
core discipline and take some courses and 
aboratory rotations that are well out of 
heir field. 

From the first week, students participate 
in research, even before choosing a thesis 
ab. In each of the three terms of the first 
year, they conduct a research project in a 
new lab. Usually two rotations fall within 
he chosen field (theoretical and experi- 
mental physics, for example) and one far 
outside it (like ecology). 

“Rotations are designed for the student 
o learn, as an insider, how to speak the 
anguage and apply the techniques used 
in a research unit. We challenge them 
go beyond their comfort zones, to lea 


Oo 


ij 


first 


bilit 


DOs 


hei 
ine 


he 
are 


ideas and collaborative projects. 
In the second year, students define 


esearch by doing research and to gain 


-hand experience working alongside 


eading scientists and faculty members,” 
says Wickens. “This opens up future possi- 


ies for meaningful interactions with re- 


searchers in different fields, leading to new 


sibilities from the cross-fertilization of 


” 


r PhD thesis topics and labs, prepar- 
a proposal that will form the basis 


of their end-of-year qualifying examina- 
ion. 


Prominent international experts in 
field of the student’s research topic 
brought to OIST to conduct an oral 


examination, setting a high international 


standard for the OIST PhD degree. 


phi 
offe 


ment. This encourages interaction and 
collaboration within and between disci- 


hroughout the programme, the OIST 
osophy of research is promoted by 
ring equal access to research equip- 


plin 


T 
qd, 


bo 
ati 
pac 


w 
O 


new discoveries. 


e 
studies and research w 
a 
n 


other leading research universities. Most 
students initially live in the campus village, 
further strengthening the OIST community 
th plenty of opportunities for socializing. 
IST is truly international. Education 


es as a way to find novel solutions and 


he OIST PhD programme is fully fund- 
allowing students to progress in thei 
thout worrying 
ut finances. They all receive an inter- 
onally competitive financial support 
age, comparable to those offered by 


and 


research are conducted in English, 


and 


the academic year starts in 


September. Students are encouraged to 


searc 


inlea 


is ast 


porte 


ies worldwide. 


interests and nati 


avel internationally to keep abreast of 
new developments, disseminate their re- 


h findings and tap into the extensive 


networks of OIST faculty members. This 
will develop future career opportunities 


ding research institutes and universi- 


The diversity of the student population 


ounding, both in terms of scientific 
onal origins. The student 


community is vibrant with members sup- 


d by active programmes to promote 


well-being and to help them support each 
other. To cover any gaps in academic 
background, se 


f-help programmes have 


sprung up both spontaneously and by de- 
sign, as ‘skill pills’ organized by the gradu- 


ate s 


needs as they arise. These, along with 
an ongoing programme of internationa 
workshops and courses, enable students 


to gal 
Sin 


outside academia i 


deve 
hrou 


mizin 


is given in presenting science, research 
conduct and ethics, with a view to maxi- 


chool to address additional study 


n teaching experience. 

ce competition for jobs within and 
s fierce, professiona 

opment opportunities are provided 

ghout the programme. Training 


g future career opportunities. Caree 


deve 


invite 


a programme of seminars by student- 


opment is seriously promoted, and 


d speakers provides perspectives on 


management, leadership and entrepre- 
neurship to help students develop into 


Fa 


> 


> 


leaders 


research skills but also a global mindset. A 
constant stream of visiting speakers covers 


diverse 
of deve 
impact 
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Stutaets: 


Ratio of students to faculty 
members 2:1 


Average class size: 4.5 


Percentage of OIST students 
who are non-Japanese: 80% 


Percentage of OIST professors 
from overseas: 70% 


Percentage of OIST professors 
awarded PhDs abroad: 73% 


Proportion of students in 
physical sciences/life sciences/ 
other: 45/45/10 


Percentage of female students: 
33% 


who not only have outstanding 


topics ranging from the economics 
opment to how to publish in high- 
journals. 


Commitment to students is one of the 


core va 
our Ph 


ues of the university. “What makes 
D programme so exceptional is 


he stu 


individu 


dent-centred approach,’ explains 


Wickens. “We recognize that students are 


als and design their programmes of 


study wi 


th them, according to their unique 


needs and scientific aims. We provide the 


resources, guidance and support they need 
for each step they take towards achieving 
heir goals in research and scholarship. 


”» 


OIST 


Contact 


https://admissions.oist.jp/ 
https://www.facebook.com/oistedu/ 


e-mail: 


admissions@oist.jp 
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Japan’s 
Most Innovative 
University 


18 Most Innovative in the World 
By Reuters Top 100 Innovative Universities 2015 


Eleven Osaka University Researchers 
Selected to Thomson Reuters’ 

"The World's Most Influential 
Scientific Minds 2015" 


“Highly Cited Researchers” are researchers selected by 
Thomson Reuters with theses that have a large number of 
citations from all over the world. In 2015, some 3,000 
researchers from 21 fields were selected as performing some 
of the most influential researcher in the world. 


= Osaka University in the Global Top 40 of Nature Index (see inside) 
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Osaka University’s 
‘open’ revolution 


In the lead up to its 90th anniversary, 
Osaka University has embraced an 
ambitious vision to increase openness 
and innovation in its research, educa- 
tion, community and governance in 
order to stimulate the kind of cross- 
disciplinary, cross-institutional and 
cross-border research needed to meet 
global challenges. 


ne feature that has characterized 
research and technological ad- 

vance since the turn of the century 

has been the steady rise and remarkable 
fruitfulness of interdisciplinary and inter- 
institutional collaboration. In its bold vi 
sion for the next six years, Osaka University 
has its sights set on taking this type of col- 
aborative openness to a new level, open 
ing the laboratory doors to the broader 
research and industrial community and 
also to society at large. 
“Our vision emphasizes openness,” says 
he university’s president, Shojiro Nishio. 
“We want to break down the barriers be- 
ween departments on campus and those 
separating the university from the outside 
world. We plan to open our academic 
findings and research outcomes to society 
hrough our five ‘pillars’ of open educa- 
ion, open research, open innovation, 
open community and open governance. 


By interweaving knowledge from differ- 
ent disciplines and working together with 
diverse participants in society, we aim to 
evolve into a place where new knowledge 
can be co-created together.” 

One key initiative of the university’s vi- 
sion for open research is the development 
of a big data platform designed to central- 
ize and share all of the accumulated data 
obtained through experiments and simu- 
lations in every academic discipline across 


the university, as well as measured in vari- 
ous fields of society. “Using this platform, 
we will be able to cross data between 
and among fields, including humanities 
and social sciences dealing with ethical, 
legal and economic issues, to create new 
integrated fields of study that can produce 
social and economic value,’ explains 


Nishio. “Our focus on the contact point 
between science, technology and society 
will foster interdisciplinary research that 
deals with global challenges. We call 
open research.” 

Osaka University embarks on this open 
esearch revolution from a position a 
Japan’s most innovative university and 
among the most innovative institutions in 
he world, ahead of global heavyweights 
ike Caltech, Johns Hopkins University 
and the University of Cambridge, accord- 


- & 


n 


ing to Reuters 2015 Top 100 Innovative 
Universities. Osaka University’s ability to 
innovate through generating original re- 
search and creating useful technology with 
economic impact stems from its broad 
disciplinary spectrum and a long history of 
challenging convention. 

With its roots in the famous Kaidokudo 
and Tekijuku schools that were established 
in 1724 and 1838 respectively and pro- 
duced an extraordinary number of Japan’s 
modern luminaries, the ‘Osaka spirit’ has 


emboldened the university community as 
a place where future leaders can rise above 
fixed social standings to engage in solving 
he social issues. Today, this means taking 
he leading global research through open 
ialogue and innovative thinking. 

“Our university was founded with th 
support of civil society, and due to thi 
tradition, we are open to society and hav 
been actively promoting collaboratio 
with industry for many years,” explain 


Q 
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Nishio. “We were the first university in “At the IFReC, we have a number of 

Japan to establish joint collaborative labo- world-renowned researchers in the 

ratories on campus through our Industry _ field, including Shizuo Akira and Shimon 

on Campus initiative. The university is ex- Sakaguchi,” says Hoshino. “Both are re- 

panding in the more dynamic University- cipients of the Thomson Reuters Citation 

ndustry Co-Creation programme. Osaka Laureate prize and the prestigious Canada 

University continues to be one of the coun- Gairdner International Award.” Akira dis- 

ry’s leading institutions — we had the covered that receptors in the body’s cells 

highest ratio of joint papers with industry sense microbes and spur the immune 

among national comprehensive universi- system to combat infection and develop 

ies in Japan in 2014 and are active in local ong-term immunity; this discovery could 

and international patent registrations.” facilitate the development of therapies for active research institutes in the world in 
cancer and allergies. Sakaguchi is inves- he field of photon science and quantum 

Research with impact igating regulatory T cells as the immune beam technology. “We conduct world- 

As a comprehensive educational and cells that suppress immune reactions. His ass research in nanophotonics, power 


€ 
research institution, Osaka University work contributes to the understanding of photonics, plasma photonics, X-ray optics 
covers a full repertoire of academic dis- hese enigmatic immune cells and has and beam optics,” says Ryosuke Kodama, 
O 
p 


ciplines. Among them, the university has guided research into various physiological ne of the world’s leading researchers in 
cultivated particularly strong research and pathological immune responses. hoton science — a field that promises to 
teams in the fields of immunology, robot- Osaka University is also globally regard- yield innovative technologies with excit- 
ics and the emerging field of photon sci- ed for its cutting-edge research on robot- ing applications in industry and medicine. 
ence and technology. ics. Hiroshi Ishiguro, for example, is a world Researchers at Osaka University collabo- 
The university is a renowned interna- leader in the development of human-like rate extensively with other national and 
tional hub for immunology research and androids as a platform for research aimed international leaders in photon science, 
hosts the prestigious Immunology Frontie at understanding how people converse covering a diverse variety of fields includ- 
Research Center (IFReC), one of nine exclu- with and operate lifelike robots. While ing life science, materials science, high- 
sive World Premier International research shiguro’s team deals with human reac- energy-density science and high-energy 
centers in Japan that have been singled out ions, the group of Minoru Asada focuses physics, supported by the Harima Center 
by the government for significant long-term on the cognitive development of robots for Photon Science located at the SPring-8 
funding. “Since the beginning of its research hemselves. Asada’s research addresses national synchrotron facility. 
activities in 2008, IFReC has published more one of the most fundamental issues for “with the distinction of being se- 
han 1,000 papers in a wide variety of im- he future of robotics: how cognition and ected as a Top Global University by the 
munology fields, with a large proportion he identification of self and others can be Japanese government and Osaka’s rank- 
in high-impact international journals,” says developed in robots, and how this relates ing as Japan’s most innovative university,” 
executive vice president, Toshiya Hoshino. o the possibility of artificial empathy. This says Hoshino, “we have a responsibility 
FReC covers a broad range of research requires research on the full developmen- o fulfil our role as a leading national uni- 
fields from autoimmune diseases to malar- al trajectory, crossing over with neural versity by providing rich courses and pro- 
ia, osteoporosis and metabolic syndrome, dynamics as the basis for understand- grammes, nurturing global-mindedness, 
incorporating the latest measurement and ing the interactions between the body, and promoting research and innovation 
simulation technologies. The work of the brain and environment that generate in the most advanced science and tech- 
centre is aimed at devising new and more rich behaviours. nology, humanities and social sciences to 
efficient development strategies for vac- With more than 100 photonics laborato- solve issues of global concern. This is our 
cines and immune therapies. ries, Osaka University is among the most ‘open’ revolution.” 
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Contact 


E-mail: 
global-ou@mlL.office.osaka-u.ac.jp 
Tel: + 816 6105 5886 

Website: 
www.osaka-u.ac.jp/en/index.html 
resou.osaka-u.ac.jp/en/ 
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R ATIONAL SCIENCE INSTITUTE 


Crossing disciplines, 
borders and scales 


Pr 


ounded almost a century ago including the world’s first supercon- 
in 1917, today RIKEN is Japan’s ducting ring cyclotron and a super- 
largest research institution in the conducting radioactive-isotope beam 


natural sciences. It has a prestigious separator. The RIBF is a leading facility 
history of conducting research inawide for accelerator-based research into the 
range of specialized scientific areas. nature and origin of the nucleus and its 
Throughout its history, RIKEN has also elementary particles. 

strived to integrate research across At the Nishina Center, investigators 


disciplines to deepen our knowledge combine experimental research on par- 
of the comprehensive nature of our icles, atomic nuclei and solid-state ma- 
physical and biological environment. erials using protons, mesons, muons 
This vision has resulted in a better and other particles with theoretica 
U 
d 


nderstanding of how phenomena on methods for exploring the most funda- 


ifferent scales — from the atomic level mental laws of physics. This combined 
o the ecosystem and beyond — shape approach, together with a clear mandate 
he world we experience. o apply new elementary findings to 


Here, we take a closer look at two medicine, agriculture and other natura 
RIKEN centres at the forefront of sciences, sets the Nishina Center apart 


integrated research: the Nishina Center from other nuclear physics institutes. 
for Accelerator-Based Science and the According to Hideto En’yo, director 
Center for Life Science Technologies. of the centre, “The Nishina Center is 
also an exponent of the global reach 
Integrated nuclear physics research RIKEN has gained with its research ini- 
On the last day of 2015, RIKEN gave a iatives. The centre operates two facili- 
press conference (bottom photograph ies abroad. In the UK, we have a muon 
on the next page) to announce that just facility at the Rutherford Appleton 
hat morning, Kosuke Morita from the Laboratory that provides access to an 
ishina Centre for Accelerator-Based intense pulsed muon beam; with this 
Science had received news from the muon beam, researchers can study the 
nternational Union of Pure and Applied electromagnetic properties of insulat- 
Chemistry that his group had been ing, metallic, magnetic and supercon- 
ecognized as the discoverers of anew ducting materials. In the USA, we have 
chemical element, element 113. It was established a combined theoretical, 
a historic milestone — the first time an experimental and computing group 
element had been discovered in Asia. at Brookhaven National Laboratory’s 
This discovery was made possible by Relativistic Heavy lon Collider to train 
one of the most powerful and versatile young scientists in the physics of strong 
heavy ion accelerators in the world. interactions, including spin physics, 


The Nishina Centre’s RI Beam Factory, lattice quantum chromodynamics and 
where the research was conducted, is the physics of relativistic heavy-ion 
a tremendous instrument made from a collisions. Our facilities are used by sci- 
linear accelerator, a series of cyclotrons, entists around the world both for basic 
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esearch and applications.” of their regulatory mechanisms and 
The discovery of element 113 was a imaging the dynamics of their mo- 
first step in the centre’s long-term goal ecular interactions.” 
of attaining one of the holy grails of nu- The centre’s mission is driven by 
clear physics — reaching the so-called he close interactions among its three 
island of stability. The island of stabil- divisions: structural and synthetic biol- 
ity theory predicts that, for elements ogy, genomic technologies, and bio- 
heavier than uranium, a set of heavy iso- function dynamics imaging. Integration 
opes will be found exhibiting a magic of technologies and research outcomes 
combination of protons and neutrons from these three divisions allows 


hat results in increasing rather than CLST to chart a clear course towards 


decreasing stability. drug discovery, new therapies and 
According to En’yo, “We have already healthcare innovation. 

begun our search for elements 119 and One of the areas CLST focuses on is 

120, our next goals in the search for su- he characterization of disease through 

per-heavy elements. Other milestones he integrated and multimodal analysis 


on this quest include the discovery of structural, topological and functional 
of 300 new isotopes. So far, we have data that reveal the pathophysiology 


detected 150.” and molecular phenomena underly- 

Ultimately, the Nishina Center is ing pre-disease and disease states. 
on a quest to unravel the origin of “We have achieved much in this field, 
he Universe by better understand- not only in experimental animals, but 
ing the inner workings of the atomic also in humans by using a multiangle 
nucleus and thus reveal the processes approach — total omics analyses, in- 
by which heavy elements came to be. eraction studies of neuroimmunoen- 
“The RIBF is producing a variety of data docrine systems, positron emission 
on unstable nuclei, nuclear reactions, omography studies, functional mag- 
nuclear structures, shapes, sizes, as well netic resonance imaging studies and 
as many new and interesting features of magneto-encephalography studies,” 
exotic nuclei,” notes En’yo. says Watanabe. “The ultimate goal is to 

find ways to prevent disease by diag- 

Comprehensive living systems nosing it and initiating treatment before 
science its onset.” 
RIKEN’s Center for Life Science Translating new — scientific and collaborations also seek to improve 
Technologies (CLST) is another cham- medical insights gathered at CLST health outcomes on a more long-term 
pion of RIKEN’s focus on integrated into applications and therapies fol- ime scale. “A perfect example of this 
research facilities. It was established to ows multiple paths including strong is a recently approved project spear- 
advance the understanding of how liv- partnerships with industry, medical headed by CLST and comprising local 
ing systems work by integrating mecha- and academic institutions. And inter- governmental offices and more than 40 
nistic insights from the atomic scale nally, CLST provides support for several healthcare, industrial and academic col- 
through molecular function and cellular RIKEN translational initiatives such as aborators,” explains Watanabe. “This 
organization to the whole-body level. he RIKEN Program for Drug Discovery project aims to optimize personalized 
This goal is achieved by approaching re- and Medical Technology Platforms medicine approaches under the um- 
search from a dynamic and system-wide and the RIKEN Preventive Medicine & brella of the Compass for Healthy Life 
perspective, rather than a traditional Diagnosis Innovation Program. These Research Complex project.” 
static and reductionist angle. 

“Integration in life science means 
orchestration of a variety of molecules 
in some architectonic features — CLST Contact 
is mostly aiming to develop novel www.riken.jp/en 
methodologies and techniques to fol- www.riken.jp/en/careers 
ow this orchestration,” explains CLST’s www.facebook.com/RIKEN.english 
director, Yasuyoshi Watanabe. “These www.twitter.com/riken_en © 
echnologies enable the analysis of Tel: +81 48 462 1225 = 
big molecular complexes, exploration E-mail: pr@riken.jp RIKEN 
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Neutrinos and beyond 
— Opening a new era 
of cosmic-ray research 


a, 


Takaaki Kajita, a recipient of the 2015 hat neutrinos are massless. These discov- 
Nobel Prize in Physics, and other re- eries, which resulted in Kajita’s 2015 Nobel 
searchers at the University of Tokyo’s Prize in Physics, were made a team led by 
Institute for Cosmic Ray Research (ICRR) him in 1998. 
have been exploring new realms in par- Kajita acknowledged his success owed 
ticle physics research. Kajita’s work on a lot to the strong support he received 
neutrinos and related research at ICRR from two mentors and former supervisors 
is leading the world in this field. — Masatoshi Koshiba and Yoji Totsuka. 
oshiba was awarded the 2002 Nobel Prize 
The path to a Nobel prize in Physics for detecting neutrinos pro- 
Particle physics and astrophysics are duced in supernovae using Kamiokande, 
among the most active research fields at he predecessor of Super-Kamiokande. 
he University of Tokyo, and its Institute for Totsuka led the Super-Kamiokande proj- 
Cosmic Ray Research (ICRR) is leading the ect as Koshiba’s successor. Totsuka’s con- 
world with explorations in these areas. ICRR ribution was so great that many believe 
is best known for its research on neutrinos he would have shared the Nobel Prize with 
using the world’s largest underground neu- ajita if he were alive. 
rino detector, Super-Kamiokande. The de- Kajita’s award-winning work dates 
ector is located in a mine in central Japan back to 1986 when he earned his PhD for 
and is filled with 50,000 tons of pure water. researching proton decay. Soon after that, 
eutrinos, which are electrically neutral ajita began to upgrade the software he 
subatomic particles, exist in three different was using to separate the proton-decay sig- 
forms — electron, muon and tau. They are nal from the background noise produced 
generated in various places, including the by atmospheric neutrino interactions. He 
centre of the Sun and the Earth’s atmo- discovered that his data did not match the- 
sphere. Although neutrinos are the most oretical calculations and began to search 
abundant particle in space after photons, for the cause of this discrepancy. He was 
hey are hard to observe because they unable to find any problems with the data 


b 
rarely interact with matter. But they hold analysis or the simulation of atmospheric 
tr 


he key to many questions about how the neutrino interactions. The Kamiokande 


Universe was formed. collaboration reported this result in 1988, 

Researchers at ICRR have discovered but the academic community was scepti- 
many important properties of these enig- cal. “The result disagreed with those ob- 
matic particles, including the fact that neu- ained by other researchers.” Kajita recalls. 
rinos oscillate, or change from one type to “But Koshiba and Totsuka understood the 
another, as they travel through the Earth. importance of the Kamiokande data and 


One implication of this finding is that neu- gave me constant support.” 
rinos have mass — a conclusion that has The breakthrough came in 1996 when 
forced scientists to reassess the standard Super-Kamiokande began operating; with- 


model of particle physics, which predicts in two years Kajita’s team had definitively 
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proved that neutrinos oscillated. They 
presented this finding at the International 
Conference on Neutrino Physics and 
Astrophysics. “To my surprise, the audi- 
ence welcomed my presentation and gave 


mea standing ovation,” Kajita says. 


More projects with Super-Kamiokande 
Since then, neutrino research has been 
steadily advancing. Some 120 research- 
ers from seven countries take turns to 
analyse the data that is being continually 
generated by Super-Kamiokande. The fa- 
cility is operated under the supervision 
of Masayuki Nakahata, head of Kamioka 
Observatory and solar neutrino specialist. 
eanwhile, Yoichiro Suzuki, deputy direc- 
or of the Kavli Institute for the Physics 
and Mathematics of the Universe (Kavli 
PMU) at the University of Tokyo, greatly 
contributed to explaining the discrepancy 
between data and theory for solar neu- 
rinos, which had long puzzled research- 
ers. Suzuki and Kajita were awarded the 
2016 Breakthrough Prize in Fundamental 
Physics for their roles in the discovery and 
study of neutrino oscillation. 
Neutrino research will be further acceler- 
ated in the next few years with the upgrade 
of Super-Kamiokande, which will allow it 
to detect supernova neutrinos more ef- 
ficiently. There are also plans to build an- 
other detector, Hyper-Kaimokande, whose 
volume will be 20 times greater than that 
of Super-Kamiokande. This mega-detector 
is expected to begin operation in 2025 
and the discovery of particle-antiparticle 
asymmetry in neutrinos is one of the ma- 


jor goals for it. 


Harvard University 


Stanford University 


University of Tokyo 
University of Oxford 


University of Cambridge 


Gamma rays and gravitational waves 
CRR’s exploration of new research fields 
goes beyond neutrinos. The institute is 
participating in the international project to 
build the Cherenkov Telescope Array (CTA), 
a next-generation observatory of very high- 
energy gamma rays. CTA will detect the 
highest-energy photons ever observed, 


which will lead to a deeper the understand- 
ing of star formation in the evolution of the 
Universe and many other events that can- 
not be observed using current technolo- 
gies. In the CTA project, ICRR’s Cherenkov 
Cosmic Gamma Ray Group is developing 
arge-scale telescopes in collaboration with 
researchers in Japan, the USA and Europe. 

Another important project at ICRR is 
he Large-Scale Cryogenic Gravitational 
Wave Telescope (KAGRA). The first-stage 
facility was completed in October 2015. 


Gravitational waves are the curvature of 
space-time and they propagate at the 
speed of light according to Einstein’s 
heory of relativity. On 11 February 2016, 
he Laser Interferometer Gravitational- 
Wave Observatory (LIGO) in the USA and 
its international collaborators made the 
momentous announcement that they had 
detected gravitational waves. “Detection o 
gravitational waves is very difficult, but the 
esearch — once thought impossible — is 
moving on thanks to the steady advance o 


science and technology and the endeavors 
of many researchers,” says Shinji Miyoki, as- 
sociate professor of ICRR and a member of 
the KAGRA project. 

Researchers are now testing the laser 
interferometer and plan to commence its 
operation with cryogenic mirrors within 
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Nature Index ranking 


=> The publication output of the University of Tokyo is ranked fourth in the 
world in terms of the weighted fractional count (data period: 1 December 


2014-30 November 2015). 
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he next year. Already, more than 200 re- 
searchers from around the world are par- 
icipating in the KAGRA project. 

Worldwide collaboration in this area 
is expected to accelerate. ICRR is poised 
o contribute to this new field of gravita- 
ional-wave astronomy by completing the 
construction of KAGRA and participating 


in international observation networks as 
soon as possible. Through collaborating 
with researchers around the world, ICRR 
aims to elucidate the mysteries of the 
Universe, by, for example, detecting the 
birth of black holes formed by the merger 
of binary neutron stars. 


Promoting cross-border cooperation 

All ICRR’s projects are undergirded by the 
efforts of graduate students and young re- 
searchers, who perform vital activities rang- 


ng from the construction of equipment to 
data analysis. This is true for all the research 
done at the University of Tokyo. Young 
scientists are essential for research, and 
they sometimes create new research fields. 
As a world hub for knowledge collabora- 
tion, the University of Tokyo is supporting 
diverse research activities and developing 
an environment where young researchers 
can explore without constraints. To this 
end, the university makes proposals to 

d 

g 

g 


he government as well as reviewing an 
reforming internal systems and publicizin 
its research activities. Support for youn 
scientists, who will play a key role in next- 
generation research, is vital and should be 
made available by both universities and 
governments. As collaborative projec 
between researchers in different countrie 
are becoming more prolific, it is crucial to 
create an environment in which research- 
ers can flexibly and actively participa 
in global projects, interact with scientis 
around the world and create new values. 
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Tokyo's strat 


Established in 2005 but with roots 
dating back more than 80 years, 
Tokyo Metropolitan University (TMU) 
serves not just its students but also 
the metropolis of Tokyo. With the es- 
tablishment of eight new world-class 
research centres, TMU is set to help 
shape the future of Tokyo and also of 
the world beyond. 


anked among Japan’s top universi- 
ties, Tokyo Metropolitan University 
is highly regarded for the quality 
and relevance of its research. Owned by 
the Tokyo Metropolitan Government, 
TMU effectively serves as the govern- 
ment’s think tank for solving some of the 
many challenges faced by the megacity 
of Tokyo. This unique relationship allows 
TMU and its researchers to contribu 
directly to the future of the metropolis 
very tangible ways. 
Pursuing ‘the ideals of metropolitan 
society, TMU through its 28 academic 
departments focuses on the issues and 
disciplines relevant to megacities in gen- 
eral — such as urban infrastructure, ener- 
gy and environment, health and welfare, 
and population ageing — as well as topics 
more specific to Tokyo, such as advanced 
disaster prevention and management. 


So 
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The hosting of the 2020 Olympic and 
Paralympic Games by Tokyo has brought 
he important relationship between TMU 
and the Tokyo Metropolitan Government 
o the fore. TMU has been charged with 
developing some of the key technologies 
hat promise to make the 2020 Games 
uly unique, engaging in research proj- 
ects in areas including smart communica- 
ions, advanced disaster prevention and 
welfare, urban environment, volunteer- 
ing, regional ties and sports. The projects 
emphasize the practical application of 
research in connection with the city, gov- 
ernmental and municipal organizations 
foreign and domestic research institu- 
tions, and business. 

To stimulate and accelerate specific 
areas of world-class research, TMU began 
establishing dedicated research centres 


in 2014 with generous funding from the 
metropolitan governmen 
now has 12 research centres specializing 
in space science; genomics and _bioin- 
formatics; artificial photosynthesis; gold 
chemistry; language, brain and genetics; 
water system engineering; community- 
centric systems; climatology; big social 
data; child and adolescent poverty; quan- 
titative finance; and building a hydrogen 
energy society. 
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egic partner 


Masatake Haruta, director of the Research 
Center for Gold Chemistry. 


Going for gold 

The first of the new research centres to 
be established was the Research Center 
for Gold Chemistry. It is housed in its own 
new building and is under the guidance of 
one of TMU’s most respected professors, 
asatake Haruta. “In 2012, TMU hosted 
he Sixth International Conference 
on Gold Science, Technology and its 
Applications, and this was in many ways 
he catalyst for our new research centre,” 
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so at lower temperatures than 
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in the world,” says Kiyoshi Kanamura, 
irector of T 


U’s new Research Center 
Hydrogen Energy Society. 
green energy system us- 
fuel in Tokyo, we need to 
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drogen refueling stations, and hydrogen 
production and carrier systems. Our new 
centre will provide comprehen- 
cross-cutting research for the 


a hydrogen energy society 
e range of disciplines.” 
of the hydrogen research 


centre, the 12th to be established under 
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Research Centres 


for: 


e Space Science 


e Genomics and Bioinformatics 


e Artificial Photosynthesis 
e Gold Chemistry 


e Language, Brain and Genetics 


e Water System Engineering 


e Community-Centric Systems 


e Climatology 
e _ Social Big Data 


e Child and Adolescent Poverty 


e Quantitative Finance 


e __ Building Hydrogen Energy 
Society 


TMU’s new research centre directi 
coordinates closely with that of t 


research centres for artificial photos 
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gen, such as biomass and solar ene 


hesis and gold chemistry. “We want 
se natural energy to produce hydro- 


in order to realize a CO,-free hydrogen 
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The Tokyo Metropolitan Governm 
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production system,” says Kanamura. 


o exhibiting Japan’ 
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t the 2020 Olympics, and the TMU 
hrough this research centre will strive to 
make that a reality. The outcomes from 
he centre, the fundamental results and 
now-how, will then be transferred to 


society to the wor 


other megacities around the world.” 
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Tokyo University of Science 


Building a better future 
with science 


Strong ethics and an emphasis on 
multidisciplinary research underpin 
efforts at Tokyo University of Science 
to solve global challenges 
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TUS's mission for education and research is to use science and technology for the sake of nature, humanity, society and their harmonious development. 


Tokyo University of Science (TUS) was founded in 1881 and is one of the oldest science and technology focused 
private universities in Japan. At its four campuses in the Tokyo metropolitan area and at its sister university TUS, 
Suwa, TUS researchers concentrate on their mission to build a better future with science. 


PAPERS 
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= NATURE INDEX 
IN 2014 


TUS IS IN THE: 


TOP 


6% 


of global institutions in the Nature Index* 


*based on WFC in the ‘overall’ category, using 2014 data 


SUBJECT 
AREAS 


HH CHEMISTRY 
I LIFE SCIENCES 
HM PHYSICAL SCIENCES 


TUS IS STRONGEST IN 


CHEMISTRY 


* based on WFC 2014 


Co-authored papers with 245 
collaborating institutions/companies 
in the Nature Index 


SOCIAL IMPACT 


DETECTION OF URANIUM AND CHEMICAL STATE 
ANALYSIS OF INDIVIDUAL RADIOACTIVE 
MICROPARTICLES EMITTED FROM THE FUKUSHIMA 
NUCLEAR ACCIDENT USING MULTIPLE 
SYNCHROTRON RADIATION 

X-RAY ANALYSES 


JOURNAL: 


ANALYTICAL CHEMISTRY 


PUBLISHED: 
2014 


* date obtained 9/12/2015 altmetric.com/details/2572954 


TOP 5 INTERNATIONAL 
COLLABORATORS 


seRSITY, 


&@) TOKYO UNIVERSITY OF SCIENCE 


nce 


1. Helmholtz Association of German Research Centres, Germany [1.25] 
2. Vanderbilt University (VU), United States of America (USA) [1] 

3. BASF SE, Germany [1] 

4. Chinese Academy of Sciences (CAS), China [0.95] 


5. University of Szeged, Hungary [0.86] 
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UNIVERSITY OF TSUKUBA: 
Transborder collaboration 


for a brighter future 


As a leading national research university 
at the heart of Tsukuba Science City, the 
University of Tsukuba is spearheading 
transborder collaboration across all or- 
ganizational, institutional and national 
barriers in order to promote transdisci- 
plinary research and education to realize 
breakthrough knowledge and solutions. 


he University of Tsukuba was found- 

ed in 1872 as modern Japan’s first 

state higher education institution. 
It was reborn 42 years ago as the anchor 
institution of the country’s premier Science 
City with the goal of promoting transdisci- 
plinary education and research and being 
open to society. Responding to the trans- 
ormation of today’s society, the university 
is embarking on the next stage of its evolu- 
tion by adopting all-encompassing reforms 
to become a transborder university. These 
eforms are based on the conviction that 
the complex issues of the twenty-first cen- 
tury require global collaboration. 


Transborder education 
Two of the university’s most ambitious 
platforms are its Campus-in-Campus 
nitiative and Re-inventing Japan Program. 
The Campus-in-Campus Initiative pro- 
motes campus sharing with partners of the 
university, allowing students and research 
ers full access to global resources while 
promoting collaboration across nationa 
borders. Joint education programmes wi 
be developed, and preeminent interna- 
tional researchers will be co-appointed. 


l 
l 
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The Re-inventing Japan Program is a 
government-funded scheme for develop- 
ing cooperative programmes targeting 
specific regions. The university has been 
awarded funding for programmes focused 
on Europe, South-East Asia, Russia and 
Central Asia, and South America. 


World-class research and innovation 

The University of Tsukuba strives to create 
new disciplines through promoting wide- 
ranging collaboration in both basic science 
and innovation. Its Center for Computational 
Science has received worldwide recognition 
— including two Gordon Bell prizes — for its 
supercomputer architecture. It is promoting 
multidisciplinary computational science 
through powering research that spans par- 
ticle physics, astrophysics, nuclear physics, 
nanoscience, life and environmental sci- 
ence, and information science. The Tsukuba 
Advance Research Alliance is seeking to un- 
ravel the mysteries of life. For example, it is 
exploring the molecular mechanism of lifes- 
pan and the control of immune response 
by regulating cellular activation and deac- 
ivation. Through the Campus-in-Campus 
Initiative, the university has established 
research units led by international principal 
investigators in the fields of cancer research 
and Buddhist philosophy. 
Transborder collaboration has led to 
innovations that are set to spawn new 
industries. Cybernic technology, which 
integrates neuroscience, physiology, robot- 
ics, computer science, medical science and 
engineering in cyborg-type medical robots 


(HAL), is pioneering nerve regeneration us- 
ing interactive biofeedback. Algal biomass 
development research has resulted in the 
discovery of two algal species with hy- 
drocarbon-producing capabilities 10-100 
imes greater than higher plants, making 
hem a highly attractive potential alterna- 
ive to petroleum. The products obtained 
from these algae are also very promising 
for use in pharmaceuticals and cosmetics. 
In addition, the Next-Generation Socia 
System Project seeks to promote collabora- 
ion across the academia-industry divide 
by realizing a sustainable mobile society in 
conjunction with Toyota Motor Corporation. 


Legacy of global collaboration 

As Japan’s top sports science univer- 
sity, Tsukuba is supporting the 2020 
Tokyo Olympic and Paralympic Games in 
all areas. This involvement is a legacy of 
Jigoro Kano, one of its founding principals 
and the father of modern judo. His motto 
“Mutual prosperity for self and others” still 
resonates as the university endeavours to 
secure a brighter future for all. 


University of Tsukuba 
FURKAS 
Contact 


1-1-1 Tennodai, Tsukuba, Ibaraki 
305-8577, Japan 


Phone: +81-29-853-2111 
Web: www.tsukuba.ac.jp/en 
Facebook: 


www.facebook.com/univ.tsukuba.en 
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Located in central Tokyo, Waseda 
University is one of Japan’s leading 
teaching and research universities. 
Since its foundation in 1882, Waseda 
has been preparing students for lead- 
ership in an ever-changing world. The 
university’s innovative spirit guides the 
work of approximately 600,000 alumni 
in more than 100 countries. 


aseda University is known for 
its entrepreneurial character, 
passed through the legacy 
of its founder, Shigenobu Ohkuma, who 
emphasized the importance of maintain- 
ing an enterprising spirit and a critical 
mind. The university's open and diverse 
spirit has produced countless leaders 
n 


industry, academia and government, 
including seven prime ministers, and the 

nders and CEOs of many multinational 
companies, such as Sony, Samsung and 
Uniqlo-brand operator Fast Retailing. The 
university ranked 33rd in the world and the 
op in Japan in the graduate employability 
ankings released in December 2015 by 
Quacquarelli Symonds. 

“Our aim is to create unique people, 
who may not fit in at conventional 
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organizations, but who challenge ortho- 
dox thinking, do something interesting 
and stand out in their fields,’ says Waseda 
University president, Kaoru Kamata. “Their 
spirit is born out of our multicultural diver- 
sity, which Waseda is proud of,” he adds. 
Waseda has more than 53,000 students, 
over 5,000 of whom are from overseas, hail- 
ing from more than 100 countries, making 
Waseda the most international campus 
in Japan according to Japan Student 
Services Organization (JASSO) data. JASSO 
also ranks the university top in Japan for 
sending its students abroad for study. 


Atruly global university 
Unlike many Japanese universities, which 
have only recently started accepting 


students from overseas, Waseda has a 
ong tradition and established systems to 
help international students enjoy learn- 
ing on campus. Just two years after its 
foundation in 1882, Waseda received its 
first overseas student from Korea, who 
was followed by the first Chinese student 
in 1899. The university’s ties with China 
have been particularly strong since the 
ate nineteenth century, as many who 
had studied at Waseda helped shape the 


Chinese Revolution of 1911, including a 
famous Japanese revolutionary and phi- 
osopher, Toten Miyazaki, who supported 
Sun Yat-sen, China’s first president. 
Other figures with strong ties to Waseda 
have also been active in creating cultural 
bridges between Japan and other nations. 
For example, Ryusaku Tsunoda, an alum- 
nus of the inaugural class, established 
he Japanese Culture Center at Columbia 
University in 1929, globalizing Japanese 
studies by melding Japanese and Western 
perspectives and approaches. 

“Since our foundation, we have looked 


to the world stage,” says Kamata. “We want 

students to harness what they have learned 

at Waseda for the benefit of the world.” 
Currently, six undergraduate schools 


and ten graduate schools, or 2,400 
courses, are taught in English. The num- 
ber of programmes that allow students 
o obtain degrees entirely in English now 
stands at 50. 
Waseda has also formed some 700 part- 
nerships with higher-education institutions 
in 79 countries, allowing students to attend 
classes or even obtain degrees from other 
universities, including Columbia University 
and Peking University. The university has 


also upgraded its curriculum, inviting pro- 


fessors from institutions 


to offer small and 


well as introducing a 


across the world 
interactive classes, as 
four-term calendar, 


rather than the calendar year starting in 
April used by most Japanese universities. 


Waseda has also launched summer ses- 


sions to give students 
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Frontline research 
One of Waseda’s greatest strengths is 
its Faculty of Science and Engineering, 
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partner- 
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